WO 01/23604 PCT/CAOO/01150 

CGGTTCCGAG GTGTCTGCCC TGCTGGGTCG 6 ' 1M JJttteTCT* GCCGTlBGGTT 4 0^ 

ACCAGCCCAC CCTCGCCGTC GACATGGGTG GCATGCAGGA GCGTATCACC 4 50 

ACCACCAAGA AGGGCTCTAT CACCTCCG 478 



2) INFORMATION FOR SEQ ID NO: 892 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 481 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

15 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM:- Penicillium marneffei 

(B) STRAIN: ATCC 58950 



20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 892 



GTCTTTATCC AGGAGTTGAT TGTACGTCTT TACCTTTCTG CCTGACTGTT 50 

TACGACAACT AACGAAAGCG TAGAACAACA TTGCCAAGGC TCACGGTGGT 100 

25 TACTCTGTCT TCACTGGTGT CGGTGAACGT ACTCGTGAGG GTAACGATTT 150 

GTACCACGAA ATGCAGGAAA CTGGTGTCAT TCAGCTCGAG GGTGAATCCA 200 

AGGTCGCCCT CGTGTTCGGT CAGATGAACG AGCCCCCCGG TGCCCGTGCC 250 

CGTGTCGCTC TTACTGGTTT GACCATTGCC GAGTACTTCC GTGACGAGGA 300 

AGGTCAGGAC GTGCTTCTCT TCATTGACAA CATTTTCCGT TTCACTCAGG 350 

3 0 CCGGTTCTGA GGTGTCTGCC CTTCTGGGTC GTATCCCCTC TGCCGTCGGT 4 00 

TACCAGCCCA CCCTTGCCGT CGACATGGGT ATCATGCAGG AGCGTATTAC 450 

CACCACCACC AAGGGTTCCA TCACCTCCGT C 481 



35 



45 



2 ) INFORMATION FOR SEQ ID NO: 893 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1208 bases 
40 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Sporothrix schenckii 

(B) STRAIN: ATCC 14285 

50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 893 

CTCGTCCGTG GTGCTAAGGC CACTGACACT GGTTSCCCCA TTACCATCCS 50 

CGTCGGCCCC GGTACCCTCG GTCGCATCAT GAACGTCACC GGTGACCCGA . 100 

TCGACGAGCG CGGTCCCATC AAGACCGACA AGTTCCGTCC CATCCACGCT 150 

55 GAGGCTCCCG AGTTCGTTGA CCAGTCGACC ACCGCTGAGG TTCTCGTGAC 200 

TGGTATCAAG GTCGTCGATC TGCTTGCTCC CTACGCCCGT GGTGGTAAGA 250 

TTGGTCTGTT TGGCGGTGCC GGTGTTGGCA AGACCGTGTT CATCCAGGAG 300 

CTCATCAACA ACATCGCCAA GGCCCACGGT GGTTACTCCG TCTTCACCGG 350 

TGTCGGCGAG CGTACCCGTG AGGGTAACGA TCTGTACCAC GAAATGCAGG 4 00 

60 AGACCTCTGT CATTCAGCTT GACGGTGACT CCAAGGTCGC CCTGGTGTTC 450 

501 
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GGTCAGATGA ACGAGCCCCC TGGTGCTCGT 
TTTGACTGTC GCTGAGTACT TCCGTGACGA 
TTTGAATTAT TTCCTTGTCG TACAGTTCCA 
GTCAGTGCTT CTCTTCATCG ACAACATTTT 
5 CTGAGGTGTC TGCCCTTCTG GGTCGTATTC 
CCCACGCTCG CCGTGGACAT GGGTCTGATG 
CCGCAAGGGC TCAATTACCT CCGTCCAGGC 
ATCTGACGGA TCCCGCCCCC GCCACCACCT 
ACTGTGCTGT CCCGAGGTAT CTCTGAGCTG 

10 CCCCCTCGAC TCCAAGTCGC GTATGCTGGA 
ACCACTACGA GACCGCCACT CGCGTCCAGC 
TCGCTGCAGG ACATCATCGC CATTCTGGGT 
CGACAAGCTT ACAGTCGAGC GTGCTCGTAA 
AGCCGTTCAC GGTCGCGCAG GTCTTCACTG 

15 GATCTGAAGG ACACTATCGC TTCGTTCAAG, 
TGACAGCC 



PCT/CAOO/01150 

GCCCGTGTCG CCTTGAC&GG 500 

GGAGGGCCAG GATGGTATGT 550 

AATCGAAGAA TTACTAACTT 600 

CCGCTTCACC CAGGCCGGTT 650 

CCTCCGCTGT CGGTTACCAG 700 

CAGGAGCGTA TTACCACCAC 750 

CGTCTACGTG CCCGCTGACG 800 

TCGCCCATCT GGACGCCACC 850 

GGTATCTACC CCGCTGTCGA 900 

CCCCCGTATT GTCGGTGACG 950 

AGATCCTCCA AGAGTACAAG 1000 

ATGGACGAGC TGTCTGAGGC 1050 

GATCCAGCGT TTCCTGAGCC 1100 

GTATCGAAGG CCAGCTGGTC 1150 

GCTATCCTGA GCGGTGAGGG 1200 

1208 



20 2) INFORMATION FOR SEQ ID NO: 894 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 534 bases 

(B) TYPE: Nucleic acid 
25 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

30 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Malbranchea filamentosa 

(B) STRAIN: ATCC 48174 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 894 



TTCAGGAACT TATTGTAAGC CGCCCTCTTT ATGCATTGAG GGTGAATAAG 50 

AAGGCTGACA GGTAATAGAA CAACATTGCC AAGGCTCACG GTGGTTACTC 100 

CGTGTTCACT GGTGTCGGTG AGCGTACCCG TGAAGGAAAC GATCTGTACC 150 

ATGAAATGCA GGAAACCCGC GTCATCCAGC TTGATGGCGA GTCTAAGGTC 200 

40 GCCCTTGTGT TCGGTCAGAT GAACGAGCCC CCTGGAGCCC GTGCCCGTGT 250 

CGCTCTTACT GGTCTTACCG TTGCCGAATA CTTCCGTGAC GAGGAGGGCC 300 

AAGATGGTAC GCCTTTTTAC TCTTCTTATT CTTCGGGTCG GACTACAGAA 350 

CTAACCTGCT CCAGTGCTTC TCTTCATTGA TAACATTTTC CGTTTCACAC 4 00 

AAGCCGGTTC TGAGGTGTCT GCCTTGCTTG GACGTATTCC CTCTGCCGTC 4 50 

4 5 GGTTACCAGC CCACTCTCGC CGTCGACATG GGTGGTATGC AGGAACGTAT 500 

CACAACCACC AACAAGGGTT CCATTACTTC CGTG 534 



50 2) INFORMATION FOR SEQ ID NO: 895 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 8 bases 

(B) TYPE: Nucleic acid 
55 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
60 (vi) ORIGINAL SOURCE: 

502 
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(A) ORGANISM: Paeci 7o m „„ t, ■ PCT/CA00/01150 

(B) STRAIN : ATCC 60735 lllaClnuS 

s (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 895 

GTS C G S A C c T TGGTGTCGGT GAGCGTACCC 50 
CTCGAGGGCG AGTCTAAGGT SS GGTCATTCAG 

CCCGGGTGCT CGTGCCCGTG TCGCrrrrln J™ TCAGA TGA ACGAGCC 150 

0 ACTTCCGTGA CCAGGAGGCT rS^SP C TGGTG TTACC GTCGCCGAGT 200 

GAAACATGTG CgScGA gS u A ?I TCTCGTC ^ATcS 250 

CATGGGTGGC ATGcS SSSS? S£2E SgggSg^ £ 

2) INFORMATION FOR SEQ ID NO: 896 

(i) SEQUENCE CHARACTERISTICS ■ 
(A) LENGTH: 483 bases ' 
B TYPE: Nucleic acid 
(C STRANDEDNESS : Double 
(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE- 

(A) ORGANISM: Aspergillus nicer 

(B) STRAIN: ATCC 9508 9 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 896 

a T c G S? gg°S£ JESS?* GATTTGGTGT « 

TGGTTACTCC GTCTTCTGTG GTGTcrrTra ^$ TCGCCA AGGCTCACGG i 00 

ATCTGTACCA CGAAAtSSg gIgIS? rrHrrl^ GAGGG TAACG 150 

TCCAAGGTCG CTCTGGTCTT CGGTCAGATC XSXSS?' GG AGGGTGAC 200 

TGCCCGTGTC GCCCTTACCG rT^SS A ^ AACQAGCCCC CGGGTGCCCG 250 

AGGAGGGTCA oSSSS SSSSSJ SSSS^ TTCGG T G ACG 30 o 

CAGGCCGGTT CTGAGGTGTC CCGTTTCACC 3 50 

CGGTTACCAG CCCACTCTGG CCGTrAarS ™ GTATCC CC TCTGCCGT 400 

TTACCACCAC CACCaISg? 1^5 GGGTGGTA TG CAGGAGCGTA 450 

483 

2) INFORMATION FOR SEQ ID NO: 897 

(i) SEQUENCE CHARACTERISTICS • 
(A) LENGTH: 1124 bases ' 
B TYPE: Nucleic acid 
C STRANDEDNESS: Double 
(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
(vi) ORIGINAL SOURCE- 

Si SSSn&TKli 1 " 

503 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 897 



PCT/CAOO/01150 



AAGCTCAAGT 
GAAGTTCCAG 
5 CGCCCCGATA 
GGTCACCGTG 
CTGCGCTATC 
TCTCCAAGGA 
GGTGTCAAGC 

10 GTCCGAGGAT 
AGAAGGTCGG 
TTCAACGGTG 
GGGATGGGAG 
TCGAGGCCAT 

15 CTCCGTCTTC 
CCCTGTCGGC 
CCTTCGCCCC 
CACCAGCAGC 
CAAGAACGTT 

20 CCAAGAACGA 
GTCCTCAACC 
CTGCCACACT 
TTGACCGCCG 
TCCGGTGATG 

25 CGAGTCCTTC 



CCGAGCGTGA 
ACTCCCAAGT 
TGTTTTGGTG 
ACTTCATCAA 
CTCATCATTG 
TGGCCAGACC 
AGCTCATCGT 
CGTTACAACG 
CTACAACCCC 
ACAACATGCT 
AAGGAGACCA 
CGACGCCATT 
CCCTCCAGGA 
CGTGTCGAGA 
CGCCAACGTC 
TCCAGGAGGG 
TCCGTCAAGG 
TCCCCCTCAG 
ACCCCGGTCA 
GCCCACATTG 
TACCGGCAAG 
CCGCCATCGT 
ACTGACTACC 



GCGTGGTATC 
ATGAGGTCAC 
CTGTAGCTAA 
GAACATGATC 
CCTCCGGTAC 
CGTGAGCACG 
CGCCCTCAAC 
AAATTGTCAA 
AAGGCCGTTC 
TGAGCCCTCC 
AGGCCGGCAA 
GAGCCCCCTG 
TGTCTACAAG, 
CCGGTATCAT 
ACCACTGAAG 
TGTCCCCGGT 
AAGTCCGCCG 
GGTGCTGCCT 
GGTCGGCGCT 
CTTGCAAGTT 
TCTGTTGAGA 
GAAGATGGTT 
CCCC 



ACCATCGACA 
TGTCATCGGT 
CACGATCTGA 
ACTGGTACCT 
TGGTGAGTTC 
CTCTGCTGGC 
AAGATGGACA 
GGAAACCTCC 
CCTTCGTCCC 
TCCAACTGCC 
GGTCACTGGT 
TCCGTCCCTY 
ATCTCTGGTA 
CAAGCCCGGC 
TCAAGTCCGT 
GACAACGTCG 
TGGTAACGTC 
CCTTCAACGC 
GGTTACGCCC 
CTCTGAGCTG 
ACAACCCCAA 
CCTTCCAAGC 



TTGCCCTCTG 50 

AAGCTCGACT 100 

AGATGCCCCC 150 

CCCAGGCTGA 200 

GAGGCTGGTA 250 

TTTCACCCTC 300 

CCTGCAAGTG 350 

AACTTCATCA 4 00 

CATCTCTGGC 450 

CCTGGTACAA 500 

AAGACCCTCA 550 

CAACAAGCCC 600 

TCGGAACGGT 650 

ATGGTCGTCA 700 

CGAGATGCAC 750 

GTTTCAACGT 800 

TGCGGTGACT 850 

CCAGGTCATC 900 

CCGTCCTCGA 950 

CTTGAGAAGA 1000 

GTTCATCAAG 1050 

CCATGTGTGT 1100 
1124 



2) INFORMATION FOR SEQ ID NO: 898 



30 



(i) SEQUENCE CHARACTERISTICS: 

(A) - LENGTH: 1363 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
35 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
40 (A) ORGANISM: PenicilJium marneffei 

(D) STRAIN: WSA-214 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 898 

45 CTCAAGGCTG AGCGTGAGCG TGGTATCACC ATCGATATTG CTCTCTGGAA 50 

GTTCCAGACT GCCAAGTACG AGGTTACCGT CATTGACGCC CCCGGTCACC 100 

GTGATTTCAT CAAGAACATG ATCACTGGTA CCTCCCAGGC CGATTGCGCT .150 

ATTCTCATCA TTGCCTCTGG TACTGGTGAA TTCGAGGCTG GTATCTCCAA 2 00 

GGATGGCCAG ACTCGTGAGC ACGCTCTTTT GGCTTTCACC CTCGGTGTCC 250 

50 GTCAGCTCAT TGTTGCCCTC AACAAGATGG ACACTTGCAA GTGGTCTCAG 300 

GGTGAGTACT CGTACCTGCG TTTGGCCTTG AATATCTTAC TAATGCACCA 350 

TAGATCGTTA CAACGAAATT GTCAAGGAGA CTTCCAACTT CATCAAGAAG 400 

GTCGGATACA ACCCTAAGAA CGTTCCTTTC GTTCCTATCT CCGGTTTCAA 450 

CGGTGACAAC ATGCTTGAGC CCTCCCCCAA CTGCCCCTGG TACAAGGGTT 500 

55 GGGAGAAGGA GACCAAGGCC GGTAAGGTCA CTGGTAAGAC CCTCCTCGAG 550 

GCCATCGACG CCATTGAGCC CCCTACCCGT CCCGCCAACA AGGTTAGTCC 600 

CTCCTCGACT ACTCAAACCC TCCTCATAAG TTCATGATTA CGACTCGTTC 650 

ACAGCCCCTC CGTCTTCCCC TCCAGGATGT CTACAAGATC GGTGGTATTG 700 

GAACGGTTCC CGTCGGTCGT GTTGAGACTG GTACCATCGT TCCTGGTATG 750 

60 GTTGTCACCT TGTAAGTCAC TCTCCTCGCT TATCCTACCT GAAATCATCA 800 

504 
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TGTGCTAACT TGACACTCAG CGCTCCCGCC 

GAGTGTTGAA ATGCACCACC AGCAGCTCAC 

ACGTTGGTTT CAACGTGAAG AACGTCTCCG 

AACGTTGCTG GTGACAGCAA GAACGACCCC 

5 CAACGCCCAG GTCATCGTCC TCAACCACCC 

ACGCCCCAGT CCTCGATTGC CACACTGCCC 

GAGCTCCTCG AGAAGATTGA CCGTCGTACC 

CCCCAAGTTC ATCAAGTCCG GTGACGCTGC 

CCAAGCCCAT GTGTGTTGAG GCTTTCACCG 

10 TTCGCCGTTC GCGAGTAAGT TTTATCTCCG 

CCTTCTCCTC CGTCTTCCAT ATATATTTTT 

ACAAATCACG GGA 



PCT/CA00/01150 

AACGTCACCA CTGAAGTCAA 850 

TGCCGGTCAG CCCGGTGACA 900 

TCAAGGAAAT CCGTCGTGGT 950 

CCTGCCGGTG CTGCCTCCTT 1000 

CGGTCAGGTC GGTGCTGGTT 1050 

ACATTGCTTG CAAGTTCGCT 1100 

GGAAAGTCTG TTGAGGACCA 1150 

CATCGTCAAG ATGATTCCTT- 1200 

AGTACCCTCC TCTCGGTCGT 1250 

TTGTCTATTT TCCATCCTTC 1300 

TCAGTTATAT GTGACTAACC 1350 

1363 
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2) INFORMATION FOR SEQ ID NO: 899 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 114 7 bases ' 
20 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



25 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Piedraia hortai 

(B) STRAIN: ATCC 24292 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 899 

CAAGCTGAAA GCCGAGCGTG AGCGTGGTAT CACTATCGAC ATTGCCCTCT 50 

GGAAGTTCGA GACTCCCAAG TACTATGTCA CCGTCATTGG TACGTCGCAT 100 

TATCTCACTC CTCACAGAAG CACGCTCCTA ACATCACACA GACGCTCCCG 150 

35 GTCACCGTGA TTTCATCAAG AACATGATCA CTGGTACCTC CCAGGCCGAC 2 00 

TGCGCTATCC TCATTATCGC TGCCGGTACT GGTGAGTTCG AGGCTGGTAT 2 50 

CTCCAAGGAT GGCCAGACTC GTGAGCACGC CCTGCTCGCC TACACCCTCG 300 

GTGTGCGTCA GCTCATCGTC GCCATCAACA AGATGGACAC CACCAAGTGG 350 

TCTGAGGCCC GTTACCAGGA GATCATCAAG GAGACCTCCA ACTTCATCAA 400 

40 GAAGGTCGGC TACAACCCCA AGACCGTCGC TTTCGTCCCC ATCTCTGGCT 450 

TCAACGGCGA CAACATGCTT GCCCCCTCCA CCAACTGCCC CTGGTACAAG 500 

GGATGGGAGC GTGAGGTCAA GGGCAACAAG CAGACCGGCA AGACCCTCCT 550 

CGAGGCCATT GACGGCATTG AGCCCCCCAA GCGTCCTTCC GACAAGCCCC 600 

TCCGTCTTCC TCTCCAGGAT GTTTACAAGA TCGGTGGTAT CGGAACTGTT 650 

45 CCTGTCGGCC GTATCGAGAC TGGTGTCCTC AAGCCCGGTA TGGTCGTTAC 700 

CTTCGCTCCC GCCAACGTCA CCACTGAAGT CAAGTCCGTC GAGATGCACC 750 

ACGAGCAGCT CACTGAGGGT CTTCCCGGTG ACAACGTCGG TTTCAACGTG 800 

AAGAACGTTT CCGTCAAGGA CATCCGCCGT GGTAACGTTG CCAGTGACTC 850 

CAAGAACGAC CCCGCTCTGG GTGCCGCTTC TTTCGACGCC CAGGTCATCG 900 

50 TCCTCAACCA CCCCGGTCAG GTCGGTGCTG GTTACGCCCC GGTCCTCGAT 950 

TGCCACACTG CCCACATTGC TTGCAAGTTC TCCGAGATCA AGGAGAAGAT 1000 

CGACCGCCGT ACCGGCAAGT CTGTTGAGGA CGCCCCCAAG TTCATCAAGT 1050 

CTGGTGACTC TGCCATCGTC AAGATGGTTC CCTCCAAGCC CATGTGCGTT 1100 

GAGGCTTTCA CCGACTACCC TCCTCTGGGC CGTTTCGCCG TCCGTGA 1147 

55 



2) INFORMATION FOR SEQ ID NO: 900 

60 (i) SEQUENCE CHARACTERISTICS: 

505 
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PCT/CA00/01150 



(A) LENGTH: 1150 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Paecilomyces lilacinus 

(B) STRAIN: ATCC 60735 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 900 

CTCAAGGCCG AGCGTGAGCG TGGTATCACC ATCGACATTG CCCTCTGGAA 50 

GTTCGAGACT CCCAAGTACT ATGTCACCGT, CATTGGTACG TCGACTCGCG 100 

CGAGACTGGT CGCAATTTCC ACGTCGCTAA CGTGCTTGAA CAGACGCTCC 150 

CGGCCACCGT GACTTCATCA AGAACATGAT CACTGGTACC TCCCAGGCTG 200 

ACTGCGCTAT CCTCATTATC GCTGCCGGCA CTGGTGAGTT CGAGGCTGGT 250 

ATCTCCAAGG ATGGCCAGAC CCGTGAGCAC GCTCTGCTCG CCTACACCCT 300 

CGGTGTTAAG CAGCTCATCG TCGCTATCAA CAAGATGGAC ACCACCAAGT 350 

GGTCTGAGGC CCGTTTCCAG GAGATCATCA AGGAGACCTC CAACTTCATC 400 

AAGAAGGTCG GCTACAACCC CAAGACCGTC GCTTTCGTCC CCATCTCTGG 450 

TTTCCACGGC GACAACATGC TTTCCCCCTC CACCAACTGC CCCTGGTACA 500 

AGGGCTGGGA GAAGGAGACC AAGGCTGGCA AGTCCACCGG CAAGACCCTC 550 

CTTGAGGCCA TCGACTCCAT CGAGCCCCCC AAGCGCCCCA GCGACAAGCC 600 

CCTCCGCCTT CCCCTTCAGG ATGTGTACAA GATCGGCGGT ATCGGCACAG 650 

TCCCTGTCGG CCGTATCGAG ACTGGTGTCA TCAAGCCCGG CATGGTCGTG 70 0 

ACCTTCGCTC CTTCCAACGT CACCACCGAA GTCAAGTCCG TTGAGATGCA 750 

CCACGAGCAG CTCTCCGAGG GTGTCCCCGG TGACAACGTC GGCTTCAACG 80 0 

TCAAGAACGT CTCCGTCAAG GAGATCCGTC GTGGCAACGT CGCCGGTGAC 85 0 

TCCAAGAACG ACCCCCCTCT GGGTGCCGCT TCTTTCGATG CCCAGGTCAT 900 

CGTCCTCAAC CACCCCGGCC AGGTCGGTGC TGGCTACGCC CCCGTCCTCG 950 

ACTGCCACAC CGCCCACATT GCCTGCAAGT TCGCCGAGAT CAAGGAGAAG 1000 

ATCGACCGCC GTACCGGCAA GTCTGTCGAG TCCGCCCCCA AGTTCATCAA 1050 

GTCTGGCGAC TCTGCCATCG TCAAGATGAT TCCCTCCAAG CCCATGTGCG 1100 

TTGAGGCTTT CACCGACTAC CCTCCTCTGG GCCGCTTCGC CGTCCGTGAC 1150 



2) INFORMATION FOR SEQ ID NO: 901 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 751 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Paracoccidioides brasiliensis 

(B) STRAIN: ATCC 32075 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 901 

TACCACTAAG TGGTCCGAGA CCCGATTCAA TGAAATTATC AAGGAAGTCA 50 

CCAATTTCAT TAAGAAGGTC GGATATAACC CCAAGACTGT TCCTTTCGTT 100 

CCCATTTCTG GTTTCGAGGG TGACAACATG ATCGAGCCCT CTGCCAACTG 150 

CCCATGGTAC AAGGGCTGGT CCAAGGAGAC TGCTCAGGGC AAGTACTCTG 200 

GCAAGACCCT TCTTGAGGCC ATCGACGCCA TTGAGCCCCC CACCCGTCCT 250 

506 
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ACCGATAAAC 
TATTGGCACT 
GTATGGTCGT 
GTCGAAATGC 
CGGCTTCAAC 
TTGCCGGTGA 
GCCCAGGTCA 
CCCAGTCCTC 
TCCTTGAGAA 
AAGTTCATCA 
G 



CTCTCCGTCT 
GTTCCTGTCG 
GACCTTCGCT 
ACCACCAGCA 
GTCAAGAATG 
CTCTAAGAAT 
TCGTCCTCAA 
GACTGCCATA 
GATTGATCGA 
AGTCCGGTGA 



TCCCCTCCAG 
GACGTGTTGA 
CCCGCCAACG 
GCTTACCGCC 
TTTCCGTCAA 
GATCCCCCAA 
CCACCCTGGT 
CTGCCCACAT 
CGAACCGGAA 
TGCTGCTATC 



GATGTCTACA AGATCTC&3G 
GACTGGAGTC ATCAAGCCCG 
TCACCACTGA AGTCAAGTCC 
GGTAACCCCG GTGACAACGT 
AGAAGTCCGC CGTGGTAACG 
AGGGCTGCGA TTCCTTCAAT 
CAGGTTGGCG CTGGTTATGC 
TGCCTGCAAA TTCGCTGAGC 
AGTCTGTTGA GAACAACCCC 
GTCAAGATGA TTCCTTCCAA 



300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
751 



10 



15 



20 



25 



30 



35 



40 



45 



50 
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2) INFORMATION FOR SEQ ID NO: 902 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1056 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Sporothrix schenckii 

(B) STRAIN: ATCC 7968 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 902 

CGTGAGCGCG GTATCACCAT CGATATTGCT CTGTGGAAGT TCGAGACCCC 50 

CAAGTACTAC GTCACCGTCA TTGACGCCCC CGGTCATCGC GATTTCATCA 100 

AGAACATGAT CACTGGTACC TCGCAGGCCG ACTGCGCCAT TCTCATCATT 150 

GCCGCTGGTA CTGGTGAGTT CGAGGCTGGT ATCTCCAAGG ATGGCCAGAC 200 
TCGTGAGCAC GCTCTGCTCG CCTACACCCT GGGTGTGCGG CAGCTGATCG . 250 

TCGCCATCAA CAAGATGGAC ACGGCCAAGT GGGCTGAGGC TCGTTACCAG 300 

GAGATCATCA AGGAGACCTC CAACTTCATC AAGAAGGTCG GCTACAACCC 350 

CAAGACTGTT GCCTTCGTCC CCATCTCGGG CTTCCACGGC GACAACATGC 400 

TTACTCCCTC GACCAACTGC CCCTGGTACA AGGGCTGGGA GAAGGAGGGC 450 

AAGAGCGGCA AGGTTAC CGG TAAGACTCTG CTGGACGCCA TTGACGCCGT 500 

CGAGCCCCCC AAGCGCCCCA CGGACAAGCC CCTGCGTCTG CCCCTCCAGG 550 

ATGTCTACAA GATCGGCGGT ATCGGCACTG TCCCTGTCGG CCGTATCGAG 600 

ACTGGTGTCC TGAAGCCCGG CATGGTCGTC ACCTTTGCCC CGTCCAACGT 650 

CACCACTGAA GTCAAGTCCG TCGAGATGCA CCACGAGCAG CTTGTTGAGG 700 

GTGTTCCCGG CGACAACGTC GGCTTCAACG TCAAGAACGT CTCCGTCAAG 750 

GAGATCCGTC GTGGCAACGT TGCCGGTGAC TCCAAGAACG ACCCCCCCTC 800 

GGGCGCCGCC ACCTTCAACG CCCAGGTCAT TGTCCTGAAC CACCCCGGCC 850 

AGGTCGGCAA CGGCTACGCC CCGGTTCTGG ACTGCCACAC CGCCCACATT 900 

GCCTGCAAGT TCACCGAGAT CCTTGAGAAG ATCGACCGCC GTACCGGCAA 950 

GTCGGTTGAG AACAACCCCA AGTTCATCAA GTCGGGTGAC GCCGCCATTG 1000 

TCAAGCTGAC GCCCTCGAAG CCCATGTGCG TTGAGGCCTT CACTGACTAC 1050 

CCCCCT 1056 



2) INFORMATION FOR SEQ ID NO: 903 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1366 bases 

(B) TYPE: Nucleic acid 
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(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Penicillium marneffei 

(B) STRAIN: ATCC 58950 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 903 

CAAGGCTGAG CGTGAGCGTG GTATCACCAT CGATATTGCT CTCTGGAAGT 50 

TCCAGACTGC CAAGTACGAG GTTACCGTCA TTGACGCCCC CGGTCACCGT 100 

GATTTCATCA AGAACATGAT CACTGGTACC TCCCAGGCCG ATTGCGCTAT 150 

TCTCATCATT GCCTCTGGTA CTGGTGAATT, CGAGGCTGGT ATCTCCAAGG 2 00 

ATGGCCAGAC TCGTGAGCAC GCTCTTTTGG CTTTCACCCT CGGTGTCCGT 250 

CAGCTCATTG TTGCCCTCAA CAAGATGGAC ACTTGCAAGT GGTCTCAGGG 300 

TGAGTACTCG TACCTGCGTT TGGCCTTGAA TATCTTACTA ATGCACCATA 350 

GATCGTTACA ACGAAATTGT CAAGGAGACT TCCAACTTCA TCAAGAAGGT 4 00 

CGGATACAAC CCTAAGAACG TTCCTTTCGT TCCTATCTCC GGTTTCAACG 450 

GTGACAACAT GCTTGAGCCC TCCCCCAACT GCCCCTGGTA CAAGGGTTGG 500 

GAGAAGGAGA CCAAGGCCGG TAAGGTCACT GGTAAGACCC TCCTCGAGGC 550 

CATCGACGCC ATTGAGCCCC CTACCCGTCC CGCCAACAAG GTTAGTCCCT 60 0 

CCTCGACTAC TCAAACCCTC CTCATAAGTT CAGATTACTG ACTCGTTCAC 650 

AGCCCCTCCG TCTTCCCCTC CAGGATGTCT ACAAGATCGG TGGTATTGGA 700 

ACGGTTCCCG TCGGTCGTGT TGAGACTGGT ACCATCGTTC CTGGTATGGT 750 

TGTCACCTTG TAAGTCACTC TCCTCGCTTA TCCTACCTGA AATCATCATG 800 

TGCTAACTTG ACACTCAGCG CTCCCGCCAA CGTCACCACT GAAGTCAAGA 850 

GTGTTGAAAT GCACCACCAG CAGCTCACTG CCGGTCAGCC CGGTGACAAC 900 

GTTGGTTTCA ACGTGAAGAA CGTCTCCGTC AAGGAAATCC GTCGTGGTAA 950 

CGTTGCTGGT GACAGCAAGA ACGACCCCCC TGCCGGTGCT GCCTCCTTCA 100 0 

ACGCCCAGGT CATCGTCCTC AACCACCCCG GTCAGGTCGG TGCTGGTTAC 1050 

GCCCCAGTCC TCGATTGCCA CACTGCCCAC ATTGCTTGCA AGTTCGCTGA 110 0 

GCTCCTCGAG AAGATTGACC GTCGTACCGG AAAGTCTGTT GAGGACCACC 1150 

CCAAGTTCAT CAAGTCCGGT GACGCTGCCA TCGTCAAGAT GATTCCTTCC 1200 

AAGCCCATGT GTGTTGAGGC TTTCACCGAG TACCCTCCTC TCGGTCGTTT 1250 

CGCCGTTCGC GAGTAAGTTT TATCTCCGTT GTCTATTTTC CATCCTTCCC 1300 

TTCTCCTCCG TCTTCCATAT ATATTTTTTC AGTTATATGT GACTAACCAC 1350 

AAATCACGGG AATAGC 1366 



2) INFORMATION FOR SEQ ID NO: 904 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 841 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

* (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Curvularia lunata 

(B) STRAIN: ATCC 26425 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 904 

CTATTATCGT TGTTGCCGCT TCCGACGGTC AAATGCCCCA GACTCGTGAG 50 
CATCTGCTGC TCGCCCGCCA GGTCGGTGTC CAGAAGATCG TTGTCTTCGT 100 

508 
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CAACAAGGTC 
TGGAGATGCG 
CCCATCATCA 
GATTGGTGTC 
5 TCCCCACCCC 
GATGTCTTCT 
GCGAGGTATC 
CCGCCCCCAT 
TGCGAGGAGT 

10 CAAGCGTGAT 
TCAAGGCGCA 
GAAGGTGGTC 
CATCCGCACT 
AGGACGCCCA 

15 TGCGAGCTCC 



GATGCTGTTG 
CGAATTGCTC 
TGGGATCTGC 
AACCGAATTG 
TCAGCGTGAG 
CCATTGCTGG 
CTGAAGCGCG 
CAAGACCAAG 
CTCGCGCTGG 
GAAGTCCGCC 
CAAGAAGTTC 
GTCACACTGG 
GCCGACGAGT 
TGACAAGCTT 
ATGCACCACA 



AGGACAAGGA 
AGCAGCTACG 
CCTCTGCGCC 
ATGAGCTGCT 
ACCGACAAGC 
TCGTGGCACT 
ATGCTGAAGT 
GTTACCGATA 
TGACAACTCC 
GTGGTATGGT 
CTTGTCTCCA 
CTTCGGTGAG 
CGTGTGCCCT 
GTTATGCCCG 
CGTCTTGGAG 



GATGTTGGAG 
GCTTCGAGGG 
ATTGAGGGCC 
CGAGGCCGTT 
CTTTCCTCAT 
GTCGTCTCTG 
TGAGCTCGTC 
TCGAGACCTT 
GGTCTCCTTC 
CGTTTCCGTC 
TGTACGTGTT 
AACTACAGGC 
GTACTGGCCA 
GTGACAACGT 
CCTGGTCAAC 



PCT/CAOO/01150 

CTCGTCGAGA 150 

CGACGAGACT 200 

GCGAACCTGA 250 

GATACTTGGA 300 

GGCCGTCGAG 350 

GCCGTGTCGA 400 

GGCAAGGGCA 450 

CAAGAAGTCC 500 

TTCGTGGTGT 550 

CCTGGACAGG 600 

GAGCAAGGAG 650 

CGCAAATGTT 700 

GAAGGCACCG 750 

CGAGATGGTT 800 

G 841 



2) INFORMATION FOR SEQ ID NO; 905 



20 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 967 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
25 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
3 0 (A) ORGANISM: Aspergillus niger 

(B) STRAIN: ATCC 9508 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 905 

35 CGGTGCTATC ATTGTCGTCG CCGCCTCCGA CGGTCAGATG TACGTTAACC 50 

TTAAAAGAAT AACTCTCCTT CAGTATATAT GCTTACACTG GCGATCAACA 100 

GGCCCCAGAC TCGTGAGCAC TTGCTGCTTG CTCGTCAGGT CGGTGTCCAG 150 

AAGATCGTTG TCTTCGTCAA CAAGGTCGAT GCTATCGATG ACCCCGAGAT 2 00 

GCTGGAGCTC GTTGAGCTGG AAATGCGCGA GCTTCTCAGC ACCTACGGAT 250 

40 TCGAGGGTGA GGAGACCCCC ATCGTCTTCG GCTCTGCTCT CTGCGCCATT 3 00 

. GAGGACCGCC GCCCCGACAT CGGTACCGAG CGTATCGACG CTCTCCTCGA 350 

GGCCGTTGAC ACCTGGATCC CCACTCCCCA GCGTGACCTT GACAAGCCTT 40 0 

TCTTGATGTC CATTGAGGAA GTTTTCTCCA TCCCCGGTCG TGGTACCGTC 450 

GCCTCCGGCC GTGTCGAGCG TGGTCTCCTG AAGCGTGATA GCGAGGTTGA 500 

45 GATCATCGGT ACCACCAACG AGGTCATCAA GACCAAGGTT ACCGACATTG 550 

AGACCTTCAA GAAGTCCTGC TCCGAGTCCC GCGCCGGTGA CAACTCCGGT 600 

CTCCTGCTCC GTGGTGTCCG CCGTGAGGAT CTCCGCCGTG GTATGGTCAT 650 

TGCCGCTCCT GGCAGCGCCA AGGCCAACAG CAAGTTCATG GTCTCCATGT 700 

ACGTCCTGAC CGAGGCTGAR GGTGGTCGCC GTACCGGTTT CGGTGTCCAG 750 

50 TACCGTCCCC AGCTGTTCAT CCGCACTGCC GGTAAGTAAA ATTGCATTCT 800 

ATTCCGCTAC TAGGGAACCA TCTCTAATTC TATTTGCTAC AGATGAGGCT 850 

GCTGAGTTCA GCTTCCCCGA CGGAGACCAG TCCCGCCGTA TCATGCCCGG 900 

TGACAACGTC GAGATGATCG TCAAGACCCA CCGCCCCGTC GCCGCCGAGG 950 

CCGGTCAGCG CTTCAAC 967 

55 

2) INFORMATION FOR SEQ ID NO: 906 

60 (i) SEQUENCE CHARACTERISTICS: 

509 
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PCT/CAOO/01150 



(A) LENGTH: 852 bases 

(B) TYPE: Nucleic acid 

(C) STRAND EDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bipolar is hawaiiensis 

(B) STRAIN: ATCC 26067 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 906 

TGGTGCTATT ATTGTTGTTG CCGCTTCCGA CGGTCAAATG CCCCAGACTC 50 

GTGAGCATCT GCTGCTCGCC CGTCAGGTCG, GTGTTCAGAA GATCGTTGTC 100 

TTCGTTAACA AGGTCGACGC TGTCGAGGAC AAGGAGATGT TGGAGCTTGT 150 

CGAGATGGAG ATGCGCGAAC TGCTCAGCAG CTATGGCTTC GAGGGCGACG 200 

AGACCCCTAT CATCATGGGT TCTGCTCTCT GCGCCATTGA AGGCCGTCAA 250 

CCCGACATTG GTGTCGAACG AATTGACGAG CTGCTCGAGG CTGTTGATAC 3 00 

TTGGATTCCC ACCCCTCAGC GTGAGACCGA AAAGCCTTTC CTCATGGCCG 350 

TCGAGGATGT CTTCTCCATT GCTGGTCGTG GTACCGTCGT CTCTGGCCGT 400 
GTCGAACGAG GTATCCTGAA GCGCGATGCT GAAGTTGAGC TTGTGGGCAA ' 450 

GGGCAGCGCA CCCATCAAGA CCAAGGTTAC CGATATCGAG ACCTTCAAGA 500 

AGTCTTGCGA GGAGTCCCGC GCTGGTGACA ACTCCGGTCT CCTTCTTCGT 550 

GGTGTTAAGC GTGATGAAGT CCGCCGTGGT ATGGTCGTTT CCGTCCCTGG 600 

ACAGGTTAAG GCGCACAAGA AGTTCCTTGT CTCCATGTAT GTGCTGAGCA 650 

AGGAGGAAGG TGGCCGACAC ACTGGCTTCG GTGAGAACTA CAGGCCGCAA 700 

ATGTTCATCC GCACTGCCGA CGAGTCGTGT GCCCTGTACT GGCCAGAAGG 750 

CACCGAGGAT GCCCACGACA AGCTTGTCAT GCCCGGTGAC AACGTCGAGA 800 

TGGTTTGCGA GCTCCATGCA CCACACGTCT TGGAGACTGG TCAGCGCTTC 850 

AA 852 



2) INFORMATION FOR SEQ ID NO: 907 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 966 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aspergillus flavus 

(B) STRAIN: ATCC 26947 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 907 

GGTGCTATTG TTGTCGTTGC TGCTTCGGAT GGTCATGATG TATGGACAGG 50 

CCCTTTGCTA CTGAATGGTT TCAAGATCTC GCGCTTACAC GTATTATAAT 100 

AGGCCCCAGA CCCGGGAGCA CTTGCTGCTT GCCCGTCAGG TCGGTGTCCA 150 

GAAGATCGTC GTTTTTGTCA ACAAGATTGA TGCCGTTGAG GACCCTGAGA 200 

TGTTGGAGCT TGTCGAGTTG GAAATGCGCG AGCTCCTTAG CAGCTACGGC 250 

TTCGAGGGCG AAGAGACTCC CATCATCTTC GGTTCTGCTC TGTGTGCTTT 300 

GGAGGACCGT CGCCCCGACA TTGGTGCCGA GCGTATCGAC GAGCTCATGA 350 

AGGCCGTTGA CACCTGGATC CCTACCCCTC AGCGTGATCT TGACAAGCCT 4 00 

TTCCTCATGT CTGTCGAGGA AGTCTTCTCC ATCGCCGGTC GTGGTACCGT 450 

TGCCTCCGGC CGTGTCGAAC GTGGTATCCT GAAGAAGGAC AGCGAAGTCG 500 

510 
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AGATCATCGG AGGTAGCTTC GATGCTACCA 
GAGACCTTCA AGAAGTCTTG TGACGAGTCC 
CTTGCTTCTC CGTGGTATCC GTCGTGAAGA 
• TTGCTGCTCC TGGCAGCACC AAGGCCCACG 
5 TACGTTCTCA CTGAGGCTGA GGGTGGTCGT 
CTACCGCCCC CAGGTGTTCG TTCGCACTGC 
TCACTTAACG GTATTGATTA AGTTCTAACT 
GCTGACCTCA GCTTCCCCGA CGGTGATGAG 
TGACAACGTC GAGATGGTCC TCAAGACTCA 
10 CTGGCCAGCG CTTCAA 



PCT/CA00/01150 

AGACCAAGGT CACCGACATT* 550 

CGCGCTGGTG ACAACTCTGG 600 

CGTCCGCCGC GGAATGATCA 650 

ACCAGTTCTT GGTGTCCATG 700 

CGTACTGGCT TCGGCTCCAA 750 

TGGTAAGTCA AGCCTTTTGC 800 

GTTGTATCCT AGATGAGGCT 850 

TCCCGGAGGG TGATGCCTGG 900 

CCGCCCCATT GCTGCTGAGG 950 

966 



2) INFORMATION FOR SEQ ID NO:. 908 



15 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 84 5 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
20 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
25 (A) ORGANISM: Alternaria alternata 

(B) STRAIN: ATCC 62099 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 908 

30 GGTGCTATCA TCGTCGTTGC TGCTTCCGAT GGTCAGATGC CCCAGACCCG 50 

TGAGCACTTG CTGCTCGCCC GTCAGGTCGG TGTTCAGAAG ATCGTTGTCT 100 

TCGTCAACAA GGTCGATGCT GTCGAAGACC CGGAGATGTT GGAACTCGTC 150 

GAGATGGAGA TGCGTGAGTT ACTCACCAGC TACGGCTTCG AGGGCGACGA 200 

GACACCCATC ATCATGGGTT CCGCTCTATG CGCCATCGAG GGCCGCCAGC 250 

35 CCGAGATCGG TGTTACCAAG GTCGACGAGC TAATGGACGC TGTCGACTCA 300 

TGGATCCCCA CCCCTCAGCG TGAGACCGAG AAGCCTTTCC TCATGGCTGT 350 

TGAGGATGTC TTCTCGATTG CTGGACGTGG TACCGTCGTT TCGGGCCGTG 400 

TCGAGCGCGG TATCTTGAAG CGTGACGCTG AAGTCGAGCT TGTCGGCAAG 450 

GGCACCGCGC CAATCAAGAC CAAGGTCACT GATATTGAGA CCTTCAAGAA 500 

40 GTCGTGCGAG GAGTCGCGCG CGGGTGATAA CTCCGGTCTT CTCCTCCGTG 550 
GTGTCAAGCG TGATGACGTT CGCCGCGGTA TGGTTGTTTC CGTTCCCGGA . 600 

CAAGTCAAGG CTCACAAGAA GTTCCTTGTC TCCATGTACG TTCTAAGCAA 650 

AGAGGAGGGT GGTCGTCACA CCGGCTTCGG CGAGAACTAC AGGCCGCAAA 700 

TGTTCATCCG AACTGCCGAT GAATCCTGCG CACTTCACTT CCCAGAGGGT 750 

4 5 ACCGAGGATG CGCACGACAA GCTAGTTATG CCCGGTGACA ACGTCGAGAT 800 

GGTCTGCGAA CTCCACCAGC CCCACGTTCT AGAGACCGGT CAGCG 845 



50 2) INFORMATION FOR SEQ ID NO: 909 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 931 bases 

(B) TYPE: Nucleic acid 
55 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
60 (vi) ORIGINAL SOURCE: 

511 
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(A) ORGANISM: Penicilliun 

(B) STRAIN: ATCC 64101 

(xi) SEQUENCE DESCRIPTION: SEQ 

CGCTGTTGTT GTCGTCGCTG CTTCTGATGG 
AGCTGCCAAT TATGGACACT GCTGATAAGA 
CACTTGCTCC TCGCCCGTCA GGTCGGTGTT 
CAACAAGGTT GATGCCGTCG AGGACCCCGA 
TGGAAATGCG TGAACTCTTG ACCACCTACG 
CCTATCATTT TCGGATCCGC TCTTTGCGCC 
GATTGGCGAA CAGAAGATTG ACGAGCTCAT 
TCCCCACCCC CCAGCGTGAC CTTGACAAGC 
GAAGTTTTCT CCATCTCTGG TCGTGGTACC 
GCGTGGTATT TTGCGCAAGG ATTCTGAGGT, 
AGAAGCCTAT CAAGACCAAG GTTACCGACA 
TGCGATGAAT CTCGTGCTGG TGACAACTCT 
CAAGCGTGAG GACATTCGTC GTGGTATGGT 
CCAAGGCTCA TGACAACTTC TTGGTCTCCA 
GAAGGTGGTC GTCGTACTGG ATTCGGCGCC 
CATCCGTACT GCCGGTATGT TCCCTTTCAA 
GCTAACGAGT TATAGATGAG GCTGCTACTC 
CAGTCCAAGC AGGTCATGCC CGGTGACAAC 
ACACCGTCCC GTTGCCGCCG AAGCTGGTCA 



PCT/CAOO/01150 
marneffei 1 



ID NO: 909 

TCAAATGTAA CATATCCACG 50 

ATAGGCCCCA AACCCGTGAG 100 

CAAAAGATCG TCGTCTTCGT 150 

GATGTTGGAA CTTGTCGAAT 200 

GTTTCGAGGG TGAAGAGACC 250 

TTGGAAGGCC GCAAGCCCGA 300 

GAACGCCGTT GATACCTGGA 350 

CCTTCTTGAT GTCCGTTGAG 4 00 

GTTGCATCTG GTCGTGTTGA 450 

TGAGATTATC GGATACCAGA 500 

TTGAGACCTT CAAGAAGTCT 550 

GGCTTGCTTC TCCGTGGTAT 600 

TATCGCTGCT CCTGGAACCA 650 

TGTATGTCTT GACTGAGGCT 700 

AACTACCGTC CTCAAGCTTT 750 

AGTCAATTAA TGAGCGATTT 800 

TCAGCTTCCC CGGTGACGAT 850 

GTTGAGATGA TCTTGAAGAC 900 

G 931 



2) INFORMATION FOR SEQ ID NO: 910 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 931 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Penicillium marneffei 

(B) STRAIN: ATCC 58950 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 910 

CGCTGTTGTT GTCGTCGCTG CTTCTGATGG TCAAATGTAA CATATCCACG 50 

AGCTGCCAAT TATGGACACT GCTGATAAGA ATAGGCCCCA AACCCGTGAG 100 

CACTTGCTCC TCGCCCGTCA GGTCGGTGTT CAAAAGATCG TCGTCTTCGT 150 

CAACAAGGTT GATGCCGTCG AGGACCCCGA GATGTTGGAA CTTGTCGAAT 200 

TGGAAATGCG TGAACTCTTG ACCACCTACG GTTTCGAGGG TGAAGAGACC 250 

CCTATCATTT TCGGATCCGC TCTTTGCGCC TTGGAAGGCC GCAAGCCCGA 300 

GATTGGCGAA CAGAAGATTG ACGAGCTCAT GAACGCCGTT GATACCTGGA 350 

TCCCCACCCC CCAGCGTGAC CTTGACAAGC CCTTCTTGAT GTCCGTTGAG 4 00 

GAAGTTTTCT CCATCTCTGG TCGTGGTACC GTTGCATCTG GTCGTGTTGA 450 

GCGTGGTATT TTGCGCAAGG ATTCTGAGGT TGAGATTATC GGATACCAGA 500 

AGAACCCTAT CAAGACCAAG GTTACCGACA TTGAGACCTT CAAGAAGTCT 550 

TGCGATGAAT CTCGTGCTGG TGACAACTCT GGCTTGCTTC TCCGTGGTAT 600 

CAAGCGTGAG GACATTCGTC GTGGTATGGT TATCGCTGCT CCTGGAACCA 650 

CCAAGGCTCA TGACAACTTC TTGGTCTCCA TGTATGTCTT GACTGAGGCT 700 

GAAGGTGGTC GTCGTACTGG ATTCGGCGCC AACTACCGTC CTCAAGCTTT 750 

CATCCGTACT GCCGGTATGT TCCCTTTCAA AGTCAATTAA TGAGCGATTT 800 

GCTAACGAGT TATAGATGAG GCTGCTACTC TCAGCTTCCC CGGTGACGAT 850 

512 
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900- 
931 



2) INFORMATION FOR SEQ ID NO: 911 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 911 
GACGGMKKCA TGCCGCARAC 2 



2) INFORMATION FOR SEQ ID NO: 912 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 912 
GACGGCGKCA TGCCGCARAC 2 



2) INFORMATION FOR SEQ ID NO: 913 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 913 
GACGGYSYCA TGCCKCAGAC 2 



2) INFORMATION FOR SEQ ID NO: 914 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 914 
GAARAGCTGC GGRCGRTAGT G 



2) INFORMATION FOR SEQ ID NO: 915 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single, 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 915 
AAACGCCTGA GGRCGGTAGT T 



2) INFORMATION FOR SEQ ID NO: 916 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 916 
GCCGAGCTGG CCGGCTTCAG 



2) INFORMATION FOR SEQ ID NO: 917 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 917 
TCGTGCTACC CGTYGCCGCC AT 



2) INFORMATION FOR SEQ ID NO: 918 

(i) SEQUENCE CHARACTERISTICS: 

5U 
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(A) LENGTH: 1391 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 

(E) ACCESSION NUMBER: J01672 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 918 

AGAGAAGCCT GTCGGCACCG TCTGGTTTGC TTTTGCCACT GCCCGCGGTG 50 

AAGGCATTAC CCGGCGGGAT GCTTCAGCGG CGACCGTGAT GCGGTGCGTC 100 

GTCAGGCTAC TGCGTATGCA TTGCAGACCT TGTGGCAACA ATTTCTACAA 150 

AACACTTGAT ACTGTATGAG CATACAGTAT AATTGCTTCA ACAGAACATA 200 

TTGACTATCC GGTATTACCC GGCATGACAG GAGTAAAAAT GGCTATCGAC 250 

GAAAACAAAC AGAAAGCGTT GGCGGCAGCA CTGGGCCAGA TTGAGAAACA 300 

ATTTGGTAAA GGCTCCATCA TGCGCCTGGG TGAAGACCGT TCCATGGATG 350 

TGGAAACCAT CTCTACCGGT TCGCTTTCAC TGGATATCGC GCTTGGGGCA 4 00 

GGTGGTCTGC CGATGGGCCG TATCGTCGAA ATCTACGGAC CGGAATCTTC 450 

CGGTAAAACC ACGCTGACGC TGCAGGTGAT CGCCGCAGCG CAGCGTGAAG 500 

GTAAAACCTG TGCGTTTATC GATGCTGAAC ACGCGCTGGA CCCAATCTAC 550 

GCACGTAAAC TGGGCGTCGA TATCGACAAC CTGCTGTGCT CCCAGCCGGA 600 

CACCGGCGAG CAGGCACTGG AAATCTGTGA CGCCCTGGCG CGTTCTGGCG 650 

CAGTAGACGT TATCGTCGTT GACTCCGTGG CGGCACTGAC GCCGAAAGCG 700 

GAAATCGAAG GCGAAATCGG CGACTCTCAC ATGGGCCTTG CGGCACGTAT 750 

GATGAGCCAG GCGATGCGTA AGCTGGCGGG TAACCTGAAG CAGTCCAACA 800 

CGCTGCTGAT CTTCATCAAC CAGATCCGTA TGAAAATTGG TGTGATGTTC 850 

GGTAACCCGG AAACCACTAC CGGTGGTAAC GCGCTGAAAT TCTACGCCTC 900 

TGTTCGTCTC GACATCCGTC GTATCGGCGC GGTGAAAGAG GGCGAAAACG 950 

TGGTGGGTAG CGAAACCCGC GTGAAAGTGG TGAAGAACAA AATCGCTGCG 1000 

CCGTTTAAAC AGGCTGAATT CCAGATCCTC TACGGCGAAG GTATCAACTT 1050 

CTACGGCGAA CTGGTTGACC TGGGCGTAAA AGAGAAGCTG ATCGAGAAAG 1100 

CAGGCGCGTG GTACAGCTAC AAAGGTGAGA AGATCGGTCA GGGTAAAGCG 1150 

AATGCGACTG CCTGGCTGAA AGATAACCCG GAAACCGCGA AAGAGATCGA 1200 

GAAGAAAGTA CGTGAGTTGC TGCTGAGCAA CCCGAACTCA ACGCCGGATT 1250 

TCTCTGTAGA TGATAGCGAA GGCGTAGCAG AAACTAACGA AGATTTTTAA 1300 

TCGTCTTGTT TGATACACAA GGGTCGCATC TGCGGCCCTT TTGCTTTTTT 1350 

AAGTTGTAAG GATATGCCAT GACAGAATCA ACATCCCGTC G 1391 



2) INFORMATION FOR SEQ ID NO: 919 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 919 
GGICCIGART CITMIGGIAA RAC 



515 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 920 
TCICCVATIT CICCITCIAI YTC 



2) INFORMATION FOR SEQ ID NO: 921 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 921 
TIYRTIGAYG CIGARCAIGC 



2) INFORMATION FOR SEQ ID NO: 922 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 922 
TARAAYTTIA RIGCIYKICC ICC 



2) INFORMATION FOR SEQ ID NO: 923 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 923 

516 
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GACGCIGCCA TCCTGATGAT C 
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21 



5 2) INFORMATION FOR SEQ ID NO: 924 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 
10 (C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 924 

ACCTCAGTCG TCACGTTGGC G 21 



20 



30 



35 



2) INFORMATION FOR SEQ ID NO: 925 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 bases 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 925 
AAGCAGATGG TTGTGTGCTG ■ ~ 20 

2) INFORMATION FOR SEQ ID NO: 926 



(i) SEQUENCE CHARACTERISTICS: 
4 0 (A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

4 5 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 926 
CAGCTGCTCG TGGTGCATCT CGAT 24 

50. 



2) INFORMATION FOR SEQ ID NO: 927 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

60 

517 
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(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 927 
5 ACGCGGAGAA GGTGCGCTT 19 



2) INFORMATION FOR SEQ ID NO: 928 



10 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 
15 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 928 

20 

GGTCGTTCTT CGAGTCACCG CA 22 



25 2) INFORMATION FOR SEQ ID NO: 929 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 8 bases 

(B) TYPE: Nucleic acid 
30 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



35 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacteroides fragilis 

(B) STRAIN: ATCC 25285 



(xi) SEQUENCE DESCRIPTION: SEQ 

TTCAGCATGC CATTTCAAAA CAGGCCGAAG 
GCTTGTGGGG AGCGTGCAAA TGAAGTTGTG 
GGAATTGGTG GACCCGCACA CGGGACGTAA 
TTATCGCAAA TACATCGAAC ATGCCGGTAG 
TATACGGCCA TGACGATTGC CGAATACTAT 
CCTGCTGATG GCAGACTCCA CTTCCCGTTG 
TGTCGAACCG TATGGAGGAG TTGCCCGGAC 
CTGTCCTCAA TCATTTCTAA CTTCTATGGC 
GAATAACGGC GAGAGCGGTT CTATTACCTT 



ID NO: 929 

CCGATATCGT GATTATCGCT 50 

GAAATCTTTA CCGAATTTCC 100 

GCTGATGGAG CGTACCATTA 150 

CAGCGCGTGA AGCTTCTGTG 200 

CGTGCCATGG GATTGAAAGT 250 

GGCGCAGGCA TTGCGTGAGA 300 

CGGATGCATT CCCGATGGAC 350 

CGTGCAGGCT ACGTGAAACT 4 00 

TATCGGTACA GTATCACC 448 



2) INFORMATION FOR SEQ ID NO: 930 



55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 438 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

60 
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(ii) MOLECULE TYPE: Genomic DNA 



PCT/CA00/01150 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacteroides distasonis 

(B) STRAIN: ATCC 8503 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 930 



10 



15 



GCTATCTCTA 
TGAGCGTGCG 
TAGACCCGCA 
AATACATCCA 
GATGACCATC 
TGGCCGACTC 
CGTTTGGAGG 
TATCGTGGCG 
ACGCTACAGG 



AACAAGCGGA 
AATGAGGTCG 
TACGGGACGT 
ACATGCCGGT 
GCCGAGTATT 
TACTTCCCGC 
AGTTGCCGGG 
AACTTCTACG 
CTCCGTCACT 



AGCGGATATC 
TAGAGGTATT 
AAATTGATGG 
AGCCGCTCGT 
ATCGCAGCAT 
TGGGCACAGG 
ACCGGATGCT 
CTCGTGCGGG 
TTCATCGGTA 



GTGATTATCG 
TACGGAGTTC 
AACGTACGAT 
GAGGCATCCG 
GGGTTTGAAG 
CTTTGCGTGA 
TTCCCGATGG 
ATTCGTTCAT 
CGGTATCG 



CCGCCTGCGG 50 

CCGGAATTGG 100 

CATTATCGCC 150 

TATATACGGC 200 

GTTCTGTTGA 250 

GATGTCCAAC 3 00 

ACTTGTCCGC 350 

TTGAATAACA 4 00 
438 



20 



2) INFORMATION FOR SEQ ID NO: 931 



25 



30 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 53 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas asaccharolytica 

(B) STRAIN: ATCC 25260 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 931 



CCTCCAGCAC 
CAGCCTGCGG 
CCTGAGCTCG 

40 CATCATCGCT 
TCTACACCGC 
GTACTCCTAA 
GATGTCTAAC 
ACTTGTCGGC 

4 5 CTCAACAACG 
AGC 



GCTATCTCTA 
TGAGCGTGCT 
AAGACCCACA 
AACACGAGTA 
TATGACCATC 
TGGCTGACTC 
CGTCTAGAGG 
TATCGTGGCA 
GTGAGACAGG 



AGCAGGCGGA 
AATGAGGTGG 
CACGGGACGC 
ACATGCCAGT 
GCTGAGTACT 
GACCTCTCGC 
AGCTGCCTGG 
AACTTCTACG 
TTCTGTAACC 



GGCTGATATC 
TGGAGATCTT 
AAGCTGATGG 
GGCTGCTCGT 
ACCGCTCGAT 
TGGGCACAGG 
ACCAGATGCA 
CTCGTGCCGG 
TTCATCGGTA 



GTCATTATGG 50 

TGCCGAGTTC 100 

AGCGTACGAT 150 

GAGGCTTCGG 200 

GGGTCTCAAA 250 

CACTGCGTGA 300 

TTCCCGATGG 350 

CTTCGTCTAT 400 

CGGTCTCTCC 450 
453 



50 2) INFORMATION FOR SEQ ID NO: 932 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 835 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



55 



60 



(ii) MOLECULE TYPE: Genomic DNA 
(vi) ORIGINAL SOURCE: 
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(A) ORGANISM : Listeria monocytogenes 

(B) STRAIN: ATCC 15313 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 932 

5 

CGGAGCTATC TTAGTAGTAT CTGCTGCTGA TGGCCCAATG CCACAAACTC 50 

GTGAACATAT CTTACTTTCA CGTCAAGTTG GTGTTCCATA CATCGTTGTA 100 

TTCATGAACA AATGTGACAT GGTTGACGAT GAAGAATTAC TAGAATTAGT 150 

TGAAATGGAA ATTCGTGATC TATTAACTGA ATATGAATTC CCTGGCGATG 200 

10 ACATTCCTGT AATCAAAGGT TCAGCTCTTA AAGCACTTCA AGGTGAAGCT 250 

GACTGGGAAG CTAAAATTGA CGAGTTAATG GAAGCTGTAG ATTCTTACAT 300 

TCCAACTCCW GAACGTGATA CTGACAAACC ATTCATGATG CCAGTTGAGG 350 

ATGTATTCTC AATCACTGGT CGTGGAACAG TTGCAACTGG ACGTGTTGAA 4 00 

CGTGGACAAG TTAAAGTTGG TGACGAAGTA GAAGTTATCG GTATCGAAGA 450 

15 AGAAAGCAAA AAAGTAGTAG TAACTGGAGT, AGAAATGTTC CGTAAATTAC 500 

TAGACTACGC TGAAGCTGGC GACAACATTG GCGCACTTCT ACGTGGTGTT 550 

GCTCGTGAAG ATATCCAACR TGGTCAAGTA TTAGCTAAAC CAGGTTCGAT 600 

TACTCCACAC ACTAACTTCA AAGCTGAAAC TTATGTTTTA ACTAAAGAAG 650 

AAGGTGGACG TCACACTCCA TTCTTCAACA ACTACCGCCC ACAATTCTAT 700 

20 TTCCGTACTA CTGACGTAAC TGGTATTGTT ACACTTCCAG AAGGTACTGA 750 

AATGGTAAYG CCTGGTGATA ACATTGAGCT TGCAGTTGAA CTAATTGCAC 800 

CAATCGCTAT CGAAGACGGT ACTAAATTCT CTATC 835 



25 



35 



40 



2) INFORMATION FOR SEQ ID NO: 933 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 24 bases 
30 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 933 
CATCATCGTI TTCMTGAACA ARTG 24 

2) INFORMATION FOR SEQ ID NO: 934 



(i) SEQUENCE CHARACTERISTICS: 
4 5 (A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

50 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 934 
TCACGYTTRR TACCACGCAG I AG A 24 

55 

2) INFORMATION FOR SEQ ID NO: 935 

60 (i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 935 
GGIAARWSIC ARYTITGYCA YAC 



2) INFORMATION FOR SEQ ID NO: 936 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 936 
TCISIYTCIG GIARRCAIGG 



2) INFORMATION FOR SEQ ID NO: 937 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 93 7 
ATIACIGARG YITTYGGIGA RTT 



2) INFORMATION FOR SEQ ID NO: 938 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 938 
CYIGTIGYIS WIGCRTGIGC 
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10 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1203 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Saccharomyces cerevisiae 
(C) ACCESSION NUMBER: D10023 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 939 

ATGTCTCAAG TTCAAGAACA ACATATATCA GAGTCACAGC TTCAGTACGG 50 

GAACGGTTCG TTGATGTCCA CTGTACCAGC AGACCTTTCA CAGTCAGTCG 100 

TTGATGGAAA CGGCAACGGT AGCAGCGAAG ATATTGAGGC CACCAACGGC 150 

20 TCCGGCGATG GTGGCGGATT GCAGGAGCAA GCGGAAGCGC AAGGTGAAAT 200 

GGAGGATGAA GCATACGATG AAGCTGCCTT AGGTTCGTTT GTGCCAATAG 250 

AAAAACTGCA AGTGAACGGG ATTACTATGG CGGATGTGAA AAAACTAAGG 300 

GAGAGTGGGC TTCACACTGC TGAAGCGGTA GCATATGCTC CCAGAAAGGA 350 

TTTATTGGAA ATCAAAGGTA TATCGGAAGC TAAGGCAGAT AAGTTGCTAA 400 

2 5 ACGAAGCGGC AAGGCTAGTG CCTATGGGAT TTGTCACGGC TGCTGATTTT 450 

CATATGAGAA GATCGGAGCT GATTTGTTTG ACAACGGGTT CTAAGAATTT 500 

GGACACTCTT TTGGGTGGTG GTGTGGAAAC TGGTTCTATT ACTGAGCTTT 550 

TCGGTGAATT CAGGACAGGT AAGTCCCAGC TATGTCACAC TTTGGCCGTG 600 

ACATGCCAAA TTCCATTGGA TATTGGTGGC GGTGAAGGTA AGTGTTTGTA 650 

30 TATCGATACC GAAGGTACTT TCAGGCCGGT AAGATTGGTA TCCATAGCTC 700 

AGCGGTTCGG ATTAGACCCG GATGATGCTT TGAACAACGT TGCGTATGCA 750 

AGAGCCTATA ACGCCGATCA TCAGTTAAGA CTTCTGGATG CTGCTGCCCA 800 

AATGATGAGC GAGTCTCGGT TTTCCTTGAT TGTGGTCGAT TCTGTTATGG 850 

CTCTATACCG TACGGATTTT TCTGGTCGTG GTGAACTAAG CGCAAGGCAA 900 

35 ATGCATTTAG CCAAATTTAT GCGTGCTTTG CAAAGGCTGG CCGACCAATT 950 

TGGTGTTGCA GTCGTCGTTA CTAACCAAGT GGTCGCCCAA GTTGATGGTG 1000 

GTATGGCTTT TAATCCAGAT CCAAAGAAGC CTATCGGTGG TAATATTATG 1050 

GCACATTCTT CCACCACGCG ATTAGGTTTC AAAAAGGGTA AGGGATGTCA 1100 

AAGATTATGC AAAGTTGTTG ACTCACCTTG CTTACCAGAG GCTGAATGTG 1150 

40 TGTTCGCGAT CTATGAAGAT GGTGTTGGTG ACCCCAGAGA AGAAGACGAG 12 00 

TAG 12 03 



4 5 2) INFORMATION FOR SEQ ID NO: 94 0 

■ (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1800 bases 

(B) TYPE: Nucleic acid 
50 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

55 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Saccharomyces cerevisiae 

(B) STRAIN: GRF88 

(C) ACCESSION NUMBER: M87549 

60 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 940 

522 
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CGATCCAATT GCTGGTCTTA AGATGCATTT GATTGATCTA GGTATTGCC^ 50 

CTGAAGCTGA AGTCAAAGCT TACGACAAGT CCGCTAGAAA ATACGTTGAC 100 

GAACAAGTTG AATTAGCTGA TGCTGCTCCT CCTCCAGAAG CCAAATTATC 150 

5 CATCTTGTTT GAAGACGTCT ACGTGAAAGG TACAGAAACT CCAACCCTAA 200 

GAGGTAGGAT CCCTGAAGAT ACTTGGGACT TCAAAAAGCA AGGTT TTGCC 250 

TCTAGGGATT AATTAAATCG TAAGGAAAAA TAAAATAATA GTGCTGTGAT 3 00 

CGCATGATAT TCTTCCCTGG AAGCGCCATT TTATAGCAAG AAATGTAAGT 350 

CAAGTATATT TTAACTGTAT ATACAACAAT ATGACTCTTT TTTATGCCTT 4 00 

10 GTTGTTTTTC TTCGGGTTTT CCCACACATT GTGTGGAGAG ATAGTTATTA 4 50 

ACAGACCGAA AATAGCCGCC CAAGGATAAA CTTTTATATA AAGGGAAGGG 500 

TAGTTGACCC AAAAATTTGG ATTCTACTTT CCAGATTTAC TTTCACCCTT 550 

TTATATTTGC TGTAGTCTGT TATGCCAATC AGGAAAGCAT TTGAACAAAT 600 

ATGTCTGTTA CAGGAACTGA GATCGATAGT GATACAGCAA AAAATATTCT 650 

15 TAGTGTAGAT GAATTACAGA ACTATGGTAT, TAATGCCTCA GATCTTCAAA 700 

AATTGAAGTC TGGTGGGATA TACACAGTCA ATGTATGTTA TAATAACATT 750 

TTTAAAACCT CTGCTGTAGA GGTTCTTTCC CCCTTTCTTT TACTAACTAA 800 

TAATTTGGAA AGGAACTTTT ATAGACCGTT TTGTCAACAA CAAGAAGACA 850 

TCTATGTAAA ATTAAAGGGT TAAGTGAGGT GAAAGTGGAA AAAATTAAAG 900 

20 AAGCTGCTGG AAAAATCATA CAGGTGGGAT TTATCCCCGC TACTGTACAA 950 

TTAGATATAA GACAGCGTGT GTACTCCTTA TCAACTGGAT CTAAGCAACT 1000 

AGATTCAATC CTAGGTGGTG GAATAATGAC AATGAGTATC ACTGAAGTAT 1050 

TTGGTGAATT TAGGTGTGGT AAGACACAGA TGTCTCATAC TTTGTGTGTT 1100 

ACCACGCAGC TTCCGAGGGA AATGGGTGGT GGTGAAGGGA AAGTAGCATA 1150 

25 TATTGATACA GAAGGCACTT TCAGGCCCGA GAGGATTAAG CAAATTGCAG 1200 

AAGGTTATGA ATTGGATCCC GAGTCATGTT TGGCAAACGT TTCATATGCT 1250 

AGAGCCTTGA ATAGTGAACA TCAAATGGAA CTTGTTGAAC AATTGGGTGA 1300 

AGAACTTAGT TCTGGAGATT ATCGCCTTAT CGTGGTAGAT TCTATAATGG 1350 

CAAACTTCAG AGTAGACTAC TGCGGTAGAG GTGAACTAAG CGAAAGACAG 14 00 

30 CAAAAGCTAA ATCAACATCT TTTCAAATTG AATAGATTGG CAGAGGAATT 1450 

TAATGTTGCA GTATTTCTGA CAAACCAAGT TCAATCAGAC CCAGGTGCTT 1500 

CTGCATTATT TGCCTCGGCA GATGGTAGGA AACCAATTGG AGGGCACGTT 1550 

CTGGCACATG CGTCAGCAAC AAGGATTTTG TTGAGAAAAG GGCGTGGTGA 1600 

CGAAAGAGTT GCCAAGTTAC AAGATTCCCC AGATATGCCT GAAAAAGAAT 1650 

35 GTGTCTACGT AATTGGTGAA AAAGGTATTA CCGATTCAAG TGACTAGTTT 1700 

TTGTATACTT TTTTAATGAA GATGACATTG CTCCTTTATT AAACTTTTCT 1750 

TTTACTTTGT GTTACTAATA TTATTAATAT CTTGTTATGA TTCTTTGTTT 1800 
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15 



20 



25 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 430 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cryptococcus humicolus 

(B) STRAIN: ATCC 38294 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 941 

CGTCCTTATC CAGGAGCTCA TTAACAACAT TGCCAAGGCC CACGGTGGTT 
TCTCCGTCTT CACCGGTGTC GGTGAGCGTA CCCGTGAGGG TAACGACCTG 
TACCACGAGA TGCGTGAGAC TGGTGTCATC AACCTCGAGG GCGACTCCAA 
GGTCGCTCTC GTCTTCGGCC AGATGAACGA GCCCCCCGGA GCCCGTGCCC 
GTGTCGCCCT TACCGGCCTC ACCATCGCCG AGTACTTCCG TGACGAGGAG 
GGTCAGGACG TGCTTCTCTT CATCGACAAC ATTTTCCGTT TCACCCAGGC 
CGGTTCCGAG GTGTCTGCCC TTCTCGGTCG TATCCCCTCG GCCGTCGGTT 
ACCAGCCCAC CCTCGCTACC GACATGGGTT CCATGCAGGA GCGTATCACC 
ACCACCAAGA AGGGTTCGAT TACCTCCGTC 



50 
100 
150 
200 
250 
300 
350 
400 
430 



30 



35 



2) INFORMATION FOR SEQ ID NO: 942 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 794 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



40 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 

(B) STRAIN: ATCC 43895 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 942 



45 CGTGTACGAT 
AAGTTCAGCA 
TCCTCCGACG 
GATCGAAGTC 
TGGGTGAACC 

50 GCGATTCACC 
ACTGCTGGAA 
AGGGCGGTAA 
AACATGATGG 
TGTGTTTGCG 

55 ACGAAATGAC 
CAGATGAACG 
GACCATGGCT 
TTGACAACAT 
CTGGGCCGTA 

60 GATGGGCGTT 



GCTCTTGAGG 
GCAGCTCGGC 
GTCTGCGTCG 
CCGGTAGGTA 
GGTCGACATG 
GCGCAGCACC 
ACCGGTATCA 
AGTTGGTCTG 
AGCTCATTCG 
GGCGTAGGTG 
CGACTCCAAC 
AGCCGCCGGG 
GAGAAATTCC 
CTATCGTTAC 
TGCCTTCAGC 
CTGCAGGAAC 



TGCAAAATGG 
GGCGGTATCG 
CGGTCTGGAT 
AAGCGACTCT 
AAAGGCGAGA 
TTCCTACGAA 
AAGTTATCGA 
TTCGGTGGTG 
TAACATCGCG 
AACGTACTCG 
GTTATCGATA 
AAACCGTCTG 
GTGACGAAGG 
ACCCTGGCCG 
GGTAGGTTAT 
GTATCACCTC 



TAATGAGCGT 
TGCGTACCAT 
GTAAAAGACC 
GGGCCGTATC 
TCGGTGAAGA 
GAGCTGTCAA 
CCTGATGTGT 
CGGGTGTAGG 
ATCGAGCACT 
TGAGGGGAAC 
AAGTATCCCT 
CGCGTAGCTC 
TCGTGACGTT 
GTACGGAAGT 
CAGCCGACCC 
CACCAAAACC 



CTGGTGCTGG 50 

CGCAATGGGT 100 

TCGAACACCC 150 

ATGAACGTAC 200 

AGAGCGTTGG 250 

ACTCTCAGGA 300 

CCGTTCGCTA 350 

TAAAACCGTA 400 

CCGGTTACTC 450 

GACTTCTACC 500 

GGTGTATGGC 550 

TGACCGGTCT 600 

CTGCTGTTCG 650 

ATCCGCACTG 700 

TGGCGGAAGA 750 

GGTT 794 
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2) INFORMATION FOR SEQ ID NO: 943 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 814 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
10 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
15 (A) ORGANISM: Escherichia coli 

(B) STRAIN: ATCC 35401 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 94 3 



20 ATGCCGTACC 
CGTCTGGTGC 
CATCGCAATG 
ACCTCGAACA 
ATCATGAACG 

25 AGAAGAGCGT 
CAAACTCTCA 
TGTCCGTTCG 
AGGTAAAACC 
ACTCCGGTTA 

30 AACGACTTCT 
CCTGGTGTAT 
CTCTGACCGG 
GTTCTGCTGT 
AGTATCCGCA 

35 CCCTGGCGGA 
ACTGGTTCTA 



GCGCGTGTAC 
TGGAAGTTCA 
GGTTCCTCCG 
CCCGATTGAA 
TACTGGGTGA 
TGGGCGATTC 
GGAACTGCTG 
CTAAGGGCGG 
GTAAACATGA 
CTCTGTGTTT 
ACCACGAAAT 
GGCCAGATGA 
TCTGACCATG 
TCGTTGACAA 
CTGCTGGGCC 
AGAGATGGGC 
TCAC 



GATGCTCTTG 
GCAGCAGCTC 
ACGGTCTGCG 
GTCCCGGTAG 
ACCGGTCGAC 
ACCGCGCAGC 
GAAACCGGTA 
TAAAGTTGGT 
TGGAGCTCAT 
GCGGGCGTAG 
GACCGACTCC 
ACGAGCCGCC 
GCTGAGAAAT 
CATCTATCGT 
GTATGCCTTC 
GTTCTGCAGG 



AGGTGCAAAA 
GGCGGCGGTA 
TCGCGGTCTG 
GTAAAGCGAC 
ATGAAAGGCG 
ACCTTCCTAC 
TCAAAGTTAT 
CTGTTCGGTG 
TCGTAACATC 
GTGAACGTAC 
AACGTTATCG 
GGGAAACCGT 
TCCGTGACGA 
TACACCCTGG 
AGCGGTAGGT 
AACGTATCAC 



TGGTAATGAG 
TCGTACGTAC 
GATGTAAAAG 
TCTGGGCCGT 
AGATCGGTGA 
GAAGAGCTGT 
CGACCTGATG 
GTGCGGGTGT 
GCGATCGAGC 
TCGTGAGGGT 
ACAAAGTATC 
CTGCGCGTTG 
AGGTCGTGAC 
CCGGTACGGA 
TATCAGCCGA 
CTCCACCAAA 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
814 



40 2) INFORMATION FOR SEQ ID NO: 944 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 798 bases 

(B) TYPE: Nucleic acid 
45 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

50 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 

(B) STRAIN: ATCC 11775 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 944 



GTGTACGATG CTCTTGAGGT GCAAAATGGT AATGAGCGTC TGGTGCTGGA 50 

AGTTCAGCAG CAGCTCGGCG GCGGTATCGT GCGTACCATC GCAATGGGTT 100 

CCTCCGACGG TCTGCGTCGC GGTCTGGATG TAAAAGACCT CGAACACCCG 150 

ATCGAAGTCC CGGTAGGTAA AGCGACTCTG GGCCGTATCA TGAACGTACT 200 

60 GGGTGAACCG GTCGACATGA AAGGCGAGAT CGGTGAAGAA GAGCGTTGGG 250 
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CGATTCACCG 
CTGCTGGAAA 
GGGCGGTAAA 
ACATGATGGA 
GTGTTTGCGG 
CGAAATGACC 
AGATGAACGA 
ACCATGGCTG 
TGACAACATC 
TGGGCCGTAT 
ATGGGCGTTC 



CGCAGCACCT 
CCGGTATCAA 
GTTGGTCTGT 
GCTTATTCGT 
GCGTAGGTGA 
GACTCCAACG 
GCCGCCGGGA 
AGAAATTCCG 
TATCGTTACA 
GCCTTCAGCG 
TGCAGGAACG 



TCCTACGAAG 
AGTTATCGAC 
TCGGTGGTGC 
AACATCGCGA 
ACGTACTCGT 
TTATCGACAA 
AACCGTCTGC 
TGACGAAGGT 
CCCTGGCCGG 
GTAGGTTATC 
TATCACCTCC 



AGCTGTCAAA CTCTCAGGAA 
CTGATGTGTC CGTTCGCTAA 
GGGTGTAGGT AAAACCGTAA 
TCGAGCACTC CGGTTACTCT 
GAGGGTAACG ACTTCTACCA 
AGTATCCCTG GTGTATGGCC 
GCGTTGCTCT GACCGGTCTG 
CGTGACGTTC TGCTGTTCGT 
TACGGAAGTA TCCGCACTGC 
AGCCGACCCT GGCGGAAGAG 
ACCAAAACCG GTTCTATC 



30/0 
350 
400 
450 
500 
550 
600 
650 
700 
750 
798 
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20 



25 



30 



35 



40 



45 



50 
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2) INFORMATION FOR SEQ ID NO: 945 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 812 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 

(B) STRAIN: ATCC 25922 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 945 

ATGCCGTACC GCGCGTGTAC GATGCTCTTG AGGTGCAAAA TGGTAATGAG 50 

CGTCTGGTGC TGGAAGTTCA GCAGCAGCTC GGCGGCGGTA TCGTGCGTAC 100 

CATCGCAATG GGTTCCTCCG ACGGTCTGCG TCGCGGTCTG GATGTAAAAG 150 

ACCTCGAACA CCCGATCGAA GTCCCGGTAG GTAAAGCGAC TCTGGGCCGT 200 

ATCATGAACG TACTGGGTGA ACCGGTCGAC ATGAAAGGCG AGATCGGTGA 250 

AGAAGAGCGT TGGGCGATTC ACCGCGCAGC ACCTTCCTAC GAAGAGCTGT 300 

CAAACTCTCA GGAACTGCTG GAAACCGGTA TCAAAGTTAT CGACCTGATG 350 

TGTCCGTTCG CTAAGGGCGG TAAAGTTGGT CTGTTCGGTG GTGCGGGTGT 400 

AGGTAAAACC GTAAACATGA TGGAGCTTAT TCGTAACATC GCGATCGAGC 450 

ACTCCGGTTA CTCTGTGTTT GCGGGCGTAG GTGAACGTAC TCGTGAGGGT 500 

AACGACTTCT ACCACGAAAT GACCGACTCC AACGTTATCG ACAAAGTATC 550 

CTTGGTGTAT GGCCAGATGA ACGAGCCGCC GGGAAACCGT CTGCGCGTTG 600 

CTCTGACCGG TCTGACCATG GCTGAGAAAT TCCGTGACGA AGGTCGTGAC 650 

GTATTGCTGT TCGTCGATAA CATCTATCGT TACACCCTGG CCGGTACCGA 700 

AGTATCCGCA CTGCTGGGCC GTATGCCTTC AGCGGTAGGT TATCAGCCGA 750 

CCCTGGCGGA AGAGATGGGC GTTCTGCAGG AACGTATCAC CTCCACCAAA 800 

ACCGGTTCTA TC 812 



2) INFORMATION FOR SEQ ID NO: 946 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 832 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria polysaccharea 

(B) STRAIN: ATCC 43768 



(xi) SEQUENCE DESCRIPTION: SEQ 



ID NO: 946 



GCGACGCTAT 
CTGACTCTGG 
TGCAATGGGT 
CTGGTGCGCC 
GTCGACGTAT 
CAAGAGCCGT 
CAACTACCGA 
CCGTTTGCTA 
CAAAACCGTG 
GCGGTCTGTC 
GACTTCTACC 
GGTTTACGGT 
TGACCGGTTT 
GGTAAAGGTC 
TCTGGCCGGT 
TGGGTTACCA 
ATTACCTCTA 



CCCGCATGTT 
AAGTTCAACA 
AGTTCAGACG 
AATCACTGTG 
TGGGTACGCC 
GCCATTCACC 
ATTGTTGGAA 
AAGGCGGTAA 
AACATGATGG 
CGTGTTCGCA 
ACGAGATGAA 
CAGATGAACG 
GACCATGGCG 
GCGACGTATT 
ACCGAAGTAT 
ACCGACATTG 
CCCAAACCGG 



TACGATGCCC 
ACTTCTGGGT 
GCCTGAAACG 
CCGGTAGGTA 
TGTTGATGAA 
AAACTGCTCC 
ACCGGTATTA 
AGTAGGTCTG 
AATTGATCAA 
GGTGTGGGCG 
AGATTCCAAC 
AACCTCCGGG 
GAATACTTCC 
GTTCTTCGTT 
CTGCACTGTT 
GCTGAAGAAA 
TTCCATTACT 



TGAAATTGGA CGAGAACGGT 
GACGGCGTTG TCCGTACTAT 
CGGCATGTCT GTAAGCAATA 
AAGGTACTTT GGGTCGTATT 
GCAGGTCCGA TCGATACCGA 
GAAATTCGAC GAGTTGTCTG 
AAGTGATCGA CTTGCTGTGT 
TTCGGTGGTG CCGGTGTAGG 
CAACATCGCC AAAGCGCACA 
AGCGTACCCG TGAAGGTAAC 
GTATTGGATA AAGTGGCAAT 
CAACCGTTTG CGCGTCGCAT 
GTGACGAAAA AGACGAAAAC 
GACAACATCT ACCGTTACAC 
GGGCCGTATG CCTTCTGCAG 
TGGGTCGTTT GCAAGAGCGT 
TC 
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2) INFORMATION FOR SEQ ID NO: 947 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 840 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria sicca 

(B) STRAIN: ATCC 9913 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 94 7 

TCCGCGCGAT GCCATTCCGC ATGTTTACGA CGCCCTGAAA TTGGATGCAA 50 

ACGGCCTGAC TTTGGAAGTA CAACAGCTTC TGGGCGACGG CGTGGTTCGT 100 

ACTATTGCAA TGGGTAGTTC GGACGGTCTG AAACGCGGCA TGACTGTAAG 150 

CAATACAGAT GCGCCGATTA CTGTGCCGGT AGGTAAAGGT ACTTTGGGAC 200 

GTATTGTCGA TGTGTTGGGT ACACCTGTTG ATGAAGCAGG TCCGATTGAT 250 

ACCGACAAAC ACCGTGCTAT CCATCAGACA GCTCCGAAAT TCGATGAGTT 300 

GTCTGCTACT ACCGAGCTGC TGGAAACAGG CATTAAAGTG ATTGACTTGC 350 

TGTGTCCGTT TGCCAAAGGC GGTAAAGTAG GTCTGTTCGG TGGTGCCGGT 4 00 

GTAGGCAAAA CCGTCAACAT GATGGAATTG ATTAACAACA TCGCCAAAGC 4 50 

GCATAGTGGT TTGTCCGTGT TCGCCGGTGT GGGGGAACGT ACCCGTGAAG 500 

GTAACGACTT CTACCACGAG ATGAAAGATT CCAACGTATT GGACAAAGTG 550 

GCGATGGTTT ACGGTCAGAT GAACGAACCT CCGGGTAACC GTCTGCGTGT 600 

AGCCTTGACC GGTTTGACGA TGGCCGAATA CTTCCGTGAT GAAAAAGACG 650 

AAAGCGGCAA AGGTCGCGAC GTATTGTTCT TCGTGGACAA CATTTACCGT 700 

TACACTCTGG CCGGTACAGA AGTATCCGCA TTGCTCGGTC GTATGCCTTC 750 

AGCAGTAGGT TACCAACCGA CATTGGCTGA AGAAATGGGT CGTCTGCAAG 800 

AGCGTATTAC CCTCTACTCA AACAGGCTCC ATTACTTCTA 840 
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2) INFORMATION FOR SEQ ID NO: 948 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 843 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus mitis 
15 (B) STRAIN: ATCC 903 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 94 8 

GCAGCTGGCG ACAAGCTACC TGAGATCAAT AATGCACTTG TAGTCTATAA 50 

20 AAATGACGAA AAAAAATCAA AAATCGTCCT TGAAGTAGCT CTTGAGCTTG 100 

GTGATGGAGT GGTTCGGACC ATCGCTATGG AATCAACGGA TGGGTTGACT 150 

CGTGGCATGG AAGTGCTAGA TACTGGCCGT CCAATTTCTG TGCCAGTCGG 2 00 

CAAAGAAACA CTTGGTCGCG TCTTTAACGT TTTGGGAGAT ACCATTGACT 250 

TGGATGCTCC TTTTGCGGAT GATGCAGAGC GCCAGCCAAT CCATAAGAAA 3 00 

25 GCTCCAACCT TTGATGAGTT GTCTACTTCT TCAGAGATCT TAGAGACAGG 350 

TATCAAGGTT ATCGACCTGT TAGCCCCTTA TCTGAAAGGT GGTAAAGTTG 400 

GACTCTTCGG TGGTGCCGGA GTTGGTAAGA CCGTCCTGAT TCAAGAATTG 4 50 

ATCCACAACA TTGCCCAAGA ACACGGTGGT ATTTCTGTAT TTACTGGCGT 500 

TGGGGAACGT ACCCGTGAAG GGAATGACCT TTATTGGGAA ATGAAAGAGT 550 

3 0 CTGGTGTTAT CGAGAAAACA GCCATGGTCT TCGGTCAGAT GAATGAGCCG 600 

CCAGGAGCGC GTATGCGGGT TGCTTTGACT GGTTTGACGA TTGCAGAATA 650 

CTTCCGTGAT GTGGAAGGTC AAGATGTCTT GCTCTTCATT GACAACATCT 700 

TCCGTTTCAC GCAGGCAGGT TCTGAAGTTT CTGCCCTTTT GGGTCGGATG 750 

CCGTCAGCCG TTGGTTACCA ACCAACACTT GCGACAGAAA TGGGGCAATT 800 

35 GCAAGAGCGT ATCACATCGA CTAAGAAGGG TTCTGTAACC TCT 843 



2) INFORMATION FOR SEQ ID NO: 94 9 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 841 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
4 5 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
50 (A) ORGANISM: Streptococcus mitis 

(B) STRAIN: ATCC 49456 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 949 

55 GCAGCAGGGG AAAAACTTCC TGAGATTAAC AATGCACTTG TCGTCTACAA 50 

AAATGACGAA AGAAAAACAA AAATCGTCCT TGAAGTAGCC TTGGAGTTGG 100 

GAGATGGTAT GGTCCGTACT ATCGCCATGG AATCAACAGA TGGTTTGACT 150 

CGTGGAATGG AAGTATTGGA CACAGGTCGT CCAATCTCTG TACCAGTAGG 200 

TAAAGAAACT TTGGGACGTG TCTTCAATGT TTTGGGAGAT ACCATTGACT 250 

60 TGGAAGCTCC TTTTACAGAA GATGCAGAGC GTCAGCCAAT TCATAAAAAA 300 
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GCTCCAACTT TTGATGAATT GTCTACCTCT 
GATTAAGGTT ATCGACCTTC TTGCCCCTTA 
GACTTTTCGG TGGTGCCGGA GTTGGTAAAA 
ATTCACAACA TTGCCCAAGA ACACGGTGGT 
TGGGGAACGT ACTCGTGAGG GTAATGACCT 
CAGGTGTTAT CGAGAAAACA GCCATGGTAT 
CCAGGAGCAC GTATGCGTGT TGCCCTAACT 
CTTCCGTGAT GTGGAAGGCC AAGACGTGCT 
TCCGTTTCAC TCAGGCTGGT TCAGAAGTAT 
CCATCAGCCG TTGGTTACCA ACCAACACTT 
GCAAGAGCGT ATTACATCAA CTAAAAAGGG 



PCT/CAOO/01150 

TCTGAAATCC TTGAAACAGG 350 

CCTTAAAGGT GGTAAGGTTG 4 00 

CCGTCTTAAT CCAAGAATTG 4 50 

ATTTCAGTAT TTACCGGTGT 500 

TTACTGGGAA ATGAAAGAAT 550 

TTGGTCAGAT GAATGAGCCG 600 

GGTTTGACAA TCGCCGAATA 650 

TCTCTTTATC GATAATATCT 700 

CTGCCCTTTT GGGTCGTATG 750 

GCTACGGAAA TGGGTCAATT 800 

TTCTGTAACC T 841 



2) INFORMATION FOR SEQ ID NO: 950, 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 827 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus mitis 

(B) STRAIN: LSPQ 2583 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 950 

GCTACCTGAG ATCAATAATG CACTTGTAGT CTATAAAAAT GACGAAAATA 50 

AATCAAAAAT CGTCCTTGAA GTAGCTCTTG AGCTTGGTGA TGGAGTGGTT 100 

CGGACCATCG CTATGGAATC AACGGATGGG TTGACTCGTG GCATGGAAGT 150 

GCTAGATACT GGTCGTCCAA TTTCTGTGCC AGTCGGCAAA GAAACACTTG 200 

GTCGCGTCTT TAACGTTTTG GGAGATACCA TTGACTTGGA TGCTCCTTTT 250 

GCGGATGATG CAGAGCGCCA GCCAATCCAT AAGAAAGCTC CAACCTTTGA 300 

TGAGTTGTCT ACTTCATCAG AGATCTTAGA GACAGGTATC AAGGTTATCG 350 

ACCTGTTAGC ACCTTATCTG AAAGGTGGTA AAGTCGGACT CTTCGGTGGT 4 00 

GCCGGAGTTG GTAAGACCGT CCTGATTCAG GAATTGATCC ACAACATTGC 450 

CCAAGAGCAT GGTGGTATTT CCGTGTTTAC CGGTGTTGGG GAACGTACCC 500 

GTGAAGGGAA TGACCTTTAC TGGGAAATGA AGGAGTCTGG CGTTATCGAG 550 

AAAACAGCCA TGGTCTTCGG TCAGATGAAT GAGCCACCAG GAGCGCGTAT 600 

GCGGGTTGCT TTGACTGGTT TGACGATTGC AGAGTACTTC CGTGATGTAG 650 

AAGGTCAAGA TGTCTTGCTC TTCATTGACA ACATCTTCCG TTTCACGCAG 700 

GCAGGTTCTG AAGTCTCTGC CCTTTTGGGT CGGATGCCAT CAGCCGTTGG 750 

TTACCAACCA ACACTTGCGA CTGAAATGGG ACAACTCCAA GAGCGTATTA 800 

CATCGACTAA GAAAGGTTCT GTAACTT 827 



2) INFORMATION FOR SEQ ID NO: 951 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 844 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus oralis 

(B) STRAIN: ATCC 35037 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 951 
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GCAGCAGGGG 
AAATGACGAA 
GTGATGGTAT 

10 CGTGGAATGG 
TAAAGAAACT 
TGGATGCTCC 
GCTCCAACTT 
GATTAAGGTT 

15 GACTCTTCGG 
ATTCACAACA 
TGGAGAACGT 
CAGGCGTTAT 
CCTGGAGCAC 

20 CTTCCGTGAT 
TCCGTTTCAC 
CCTTCAGCCG 
GCAAGAACGT 



AAACACTTCC 
AGAAAAACAA 
GGTCCGTACG 
AAGTTTTGGA 
TTGGGACGTG 
TTTCGCTGAA 
TTGATGAATT 
ATCGACCTTC 
TGGTGCCGGA 
TTGCCCAAGA 
ACCCGTGAGG 
CGAGAAAACA 
GTATGCGTGT 
GTAGAAGGCC 
TCAAGCTGGT 
TTGGTTACCA 
ATCACATCAA 



TGAGATTAAT 
AAATCGTCCT 
ATCGCCATGG 
CACAGGCCGT 
TCTTCAACGT 
GACGCTGAGC 
GTCTACCTCA 
TTGCCCCTTA 
GTTGGTAAAA 
ACATGGTGGT 
GGAACGACCT 
GCCATGGTAT 
TGCTCTTACT 
AAGATGTGCT 
TCAGAAGTAT 
ACCAACACTT 
CTAAGAAGGG 



ID NO: 951 






AATGCACTTG 


TCGTCTACAA 


50 


TGAAGTAGCC 


TTGGAGTTGG 


100 


AATCAACAGA 


TGGTTTGACT 


150 


CCAATCTCTG 


TACCAGTAGG 


200 


TTTGGGAGAT 


ACTATTGACT 


250 


GTCAGCCAAT 


TCATAAGAAA 


300 


TCTGAAATCT 


TGGAAACAGG 


350 


CCTTAAAGGT 


GGGAAAGTTG 

****** t*> ^ 


400 




CCAAGAGTTG 


450 


ATTTCAGTAT 


TTACCGGTGT 


500 


TTACTGGGAA 


ATGAAAGAAT 


550 


TTGGTCAGAT 


GAATGAGCCA 


600 


GGTTTGACAA 


TCGCCGAATA 


650 


TCTCTTTATC 


GACAATATCT 


700 


CTGCCCTTTT 


GGGTCGGATG 


750 


GCTACGGAAA 


TGGGTCAATT 


800 


TTCTGTAACC 


TCTA 


844 



25 



2) INFORMATION FOR SEQ ID NO 
(i) 
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SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 83 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-06 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 952 



GCAGCAGGGG 
AAATGACGAA 
GAGATGGTAT 
CGTGGAATGG 
TAAAGAAACT 
TGGAAGCTCC 
GCTCCAACTT 
GATCAAGGTT 
GACTTTTCGG 
ATTCACAACA 
TGGGGAACGT 
CAGGCGTTAT 
CCAGGAGCAC 
CTTCCGTGAT 
TCCGTTTCAC 
CCATCAGCCG 
GCAAGAACGT 



AAAAACTTCC 
AGAAAAACAA 
GGTTCGTACT 
AAGTATTGGA 
TTGGGACGTG 
TTTTACAGAA 
TTGATGAGTT 
ATTGACCTTC 
TGGTGCCGGA 
TTGCCCAAGA 
ACTCGTGAGG 
CGAGAAAACA 
GTATGCGTGT 
GTGGAAGGCC 
TCAGGCTGGT 
TTGGTTACCA 
ATCACATCAA 



TGAGATTAAC 
AAATCGTCCT 
ATCGCCATGG 
CACAGGTCGT 
TCTTCAACGT 
GACGCAGAGC 
GTCTACCTCT 
TTGCCCCTTA 
GTTGGTAAAA 
GCACGGTGGT 
GGAATGACCT 
GCCATGGTCT 
TGCCCTTACT 
AAGACGTGCT 
TCAGAAGTAT 
ACCAACACTT 
CCAAGAAGGG 



AATGCACTTG 
TGAAGTAGCC 
AATCAACAGA 
CCAATCTCTG 
TTTGGGAGAT 
GTCAGCCAAT 
TCTGAAATCC 
CCTTAAAGGT 
CCGTCTTAAT 
ATTTCAGTAT 
TTACTGGGAA 
TTGGTCAGAT 
GGTTTGACAA 
TCTCTTTATC 
CTGCCCTTTT 
GCTACGGAAA 



TCGTCTACAA 50 

TTGGAGTTAG 100 

TGGGTTGACT 150 

TACCAGTAGG 2 00 

ACCATTGATT 250 

TCATAAAAAA 300 

TTGAAACAGG 350 

GGTAAAGTTG 4 00 

CCAAGAATTG 4 50 

TTACTGGTGT 500 

ATGAAAGAAT 550 

GAATGAGCCA 600 

TCGCTGAATA 650 

GATAATATCT 700 

GGGTCGTATG 750 

TGGGTCAATT 800 
830 
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2) INFORMATION FOR SEQ ID NO: 953 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 823 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-11 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 953 

AAAAACTTCC TGAGATTAAC AATGCACTTG TCGTCTACAA AAATGACGAA 50 

AGAAAAACAA AAATCGTCCT TGAAGTAGCC TTGGAGTTAG GAGATGGTAT 100 

GGTTCGTACT ATCGCCATGG AATCAACAGA TGGGTTGACT CGTGGAATGG 150 

AAGTATTGGA CACAGGTCGT CCAATCTCTG TACCAGTAGG TAAAGAAACT 2 00 

TTGGGACGTG TCTTCAACGT TTTGGGAGAT ACCATTGATT TGGAAGCTCC 250 

TTTTACAGAA GACGCAGAGC GTCAGCCAAT TCATAAAAAA GCTCCAACTT 300 

TTGATGAGTT GTCTACCTCT TCTGAAATCC TTGAAACAGG GATCAAGGTT 350 

ATTGACCTTC TTGCCCCTTA CCTTAAAGGT GGTAAAGTTG GACTTTTCGG 4 00 

TGGTGCCGGA GTTGGTAAAA CCGTCTTAAT CCAAGAATTG ATTCACAACA 450 

TTGCCCAAGA GCACGGTGGT ATTTCAGTAT TTACTGGTGT TGGGGAACGT 500 

ACTCGTGAGG GGAATGACCT TTACTGGGAA ATGAAAGAAT CAGGCGTTAT 550 

CGAGAAAACA GCCATGGTCT TTGGTCAGAT GAATGAGCCA CCAGGAGCAC 600 

GTATGCGTGT TGCCCTTACT GGTTTGACAA TCGCTGAATA CTTCCGTGAT 650 

GTGGAAGGCC AAGACGTGCT TCTCTTTATC GATAATATCT TCCGTTTCAC 700 

TCAGGCTGGT TCAGAAGTAT CTGCCCTTTT GGGTCGTATG CCATCAGCCG 750 

TTGGTTACCA ACCAACACTT GCTACGGAAA TGGGTCAATT GCAAGAACGT 800 

ATCACATCAA CTAAGAAGGG TTC 823 



2) INFORMATION FOR SEQ ID NO: 954 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 844 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-55 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 954 

GCAGCAGGGG AAAAACTTCC TGAGATTAAC AATGCACTTG TCGTCTACAA 50 

AAATGACGAA AGAAAAACAA AAATCGTCCT TGAAGTAGCC TTGGAGTTAG 100 

GAGATGGTAT GGTTCGTACT ATCGCCATGG AATCAACAGA TGGGTTGACT 150 

CGTGGAATGG AAGTATTGGA CACAGGTCGT CCAATCTCTG TACCAGTAGG 200 

TAAAGAAACT TTGGGACGTG TCTTCAACGT TTTGGGAGAT ACCATTGACT 250 

TGGAAGCTCC TTTTACAGAA GACGCAGAGC GTCAGCCAAT TCATAAAAAA 300 
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GCTCCAACTT 
GATCAAGGTT 
GACTTTTCGG 
ATTCACAACA 
TGGGGAACGT 
CAGGCGTTAT 
CCAGGAGCAC 
CTTCCGTGAT 
TCCGTTTCAC 
CCATCAGCCG 
GCAAGAACGT 



TTGATGAGTT 
ATTGACCTTC 
TGGTGCCGGA 
TTGCCCAAGA 
ACTCGTGAGG 
CGAGAAAACA 
GTATGCGTGT 
GTGGAAGGCC 
TCAGGCTGGT 
TTGGTTACCA 
ATCACATCAA 



GTCTACCTCT 
TTGCCCCTTA 
GTTGGTAAAA 
GCACGGTGGT 
GGAATGACCT 
GCCATGGTCT 
TGCCCTTACT 
AAGACGTGCT 
TCAGAAGTAT 
ACCAACACTT 
CCAAGAAGGG 



TCTGAAATCC 
CCTTAAAGGT 
CTGTCTTAAT 
ATTTCAGTAT 
TTACTGGGAA 
TTGGTCAGAT 
GGTTTGACAA 
TCTCTTTATC 
CTGCCCTTTT 
GCTACGGAAA 
TTCTGTAACC 



TTGAAACAGG 
GGTAAAGTTG 
CCAAGAATTG 
TTGCTGGTGT 
ATGAAAGAAT 
GAATGAGCCA 
TCGCTGAATA 
GATAATATCT 
GGGTCGTATG 
TGGGTCAATT 
TCTA 



350 
400 
450 
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550 
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650 
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750 
800 
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2) INFORMATION FOR SEQ ID NO: 955 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 834 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-05 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 955 

AAAAACTTCC TGAGATTAAC AATGCACTTG TCGTCTACAA AAATGACGAA 50 

AGAAAAACAA AAATCGTCCT TGAAGTAGCC TTGGAGTTAG GAGATGGTAT 100 

GGTTCGTACT ATCGCCATGG AATCAACAGA TGGGTTGACT CGTGGAATGG 150 

AAGTATTGGA CACAGGTCGT CCAATCTCTG TACCAGTAGG TAAAGAAACT 2 00 

TTGGGACGTG TCTTCAACGT TTTGGGAGAT ACCATTGATT TGGAAGCTCC 2 50 

TTTTACAGAA GACGCAGAGC GTCAGCCAAT TCATAAAAAA GCTCCAACTT 3 00 

TTGATGAGTT GTCTACCTCT TCTGAAATCC TTGAAACAGG GATCAAGGTT 350 

ATTGACCTTC TTGCCCCTTA CCTTAAAGGT GGTAAAGTTG GACTTTTCGG 4 00 

TGGTGCCGGA GTTGGTAAAA CCGTCTTAAT CCAAGAATTG ATTCACAACA 450 

TTGCCCAAGA GCACGGTGGT ATTTCAGTAT TTACTGGTGT TGGGGAACGT 500 

ACTCGTGAGG GGAATGACCT TTACTGGGAA ATGAAAGAAT CAGGCGTTAT 550 

CGAGAAAACA GCCATGGTCT TTGGTCAGAT GAATGAGCCA CCAGGAGCAC 600 

GTATGCGTGT TGCCCTTACT GGTTTGACAA TCGCTGAATA CTTCCGTGAT 650 

GTGGAAGGCC AAGACGTGCT TCTCTTTATC GATAATATCT TCCGTTTCAC 700 

TCAGGCTGGT TCAGAAGTAT CTGCCCTTTT GGGTCGTATG CCATCAGCCG 750 

TTGGTTACCA ACCAACACTT GCTACGGAAA TGGGTCAATT GCAAGAACGT 800 

ATCACATCAA CCAAGAAGGG TTCTGTAACC TCTA 834 



2) INFORMATION FOR SEQ ID NO: 956 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 495 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Babesia microti 

(B) STRAIN: Persing-1 
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(xi) SEQUENCE DESCRIPTION: SEQ 



ID NO: 956 



TTGTATATCA 
ACGTGGGTTG 
TTCCCTGAAC 
ACGTACATGC 
AAGCTAGTAT 
GGTTACAACT 
TCTAAGGGAA 
ATCCGGCCTA 
GGGATAACTG 
ACACGAACCG 



CAGGCACTCA 
TGGTGAACGT 
TATCTACTGT 
TTAGTTGCCA 
ATACACTGGT 
GCACTCTTAT 
ATTTCTGGTA 
TTTATCGTCA 
TCTAATTAAT 
GATCTATTAC 



GCAAATATTC CGATACTGAC GTAATTATAT 
GGGAATGAAA TGGCTGAGAT TCTATGCGAA 
AGTTAATGAT GAAAAGGTGG CCATTATGGA 
ATACTTCTAA TATGCCAGTG GCCGCTAGAG 
ATTACAATTG CTGAATATTT CCGTGATATG 
GGCCGATTCC ACTAGCCGAT GGGCAGAGGC 
GATTGGCTGA AATGCCTGCA GATTCTGGCT 
AGGTTGTCAG CTTTTTATGA ACGTGCAGGT 
TTAGGCTTGA, TTAAGTGCTT AGGTTCACCA 
GGTTGTAGGA GCAGTTTCTC CACCA 



50 
100 
150 
200 
250 
300 
350 
400 
450 
495 



2) INFORMATION FOR SEQ ID NO: 957 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 46 9 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Entamoeba histolytica 

(B) STRAIN: HM1-1MSS 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 957 

AGTTATTTCA CAAGCATTAA GTAAATATAG TAATTCAGAT GTTATTATTT 50 

ATGTAGGATG TGGTGAACGA GGAAATGAAA TGGCAGAAGT TCTTCGAGAT 100 

TTTCCAGCTC TTTCTATTAA AGTAGGAGAT AAAGAAGAAT CTATTATGAC 150 

AAGAACAGCA CTTGTTGCTA ATACATCTAA TATGCCTGTT GCAGCACGTG 2 00 

AAGCATCAAT TTATACTGGA ATTACATTAT CAGAATATTA TAGAGATATG 250 

GGATATAATG TTGCTATGAT GGCAGATTCA ACATCAAGAT GGGCTGAAGC 3 00 

ACTTAGAGAA ATTTCAGGAC GTCTTGCAGA AATGCCAGCT GATTCTGGAT 350 

ATCCAGCATA TCTTGCAGCA CGTTTAGCAT CATTTTATGA ACGTGCAGGT 400 

ATGGTTGAAT GTTTAGGATC ACCAAAAAGA ATAGGGTCAG TTTCTATTGT 450 

AGGAGCTGTT TCACCACCT 469 



2) INFORMATION FOR SEQ ID NO: 958 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 52 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Fusobacteriwn nucleatum subsp. polymorph 

533 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

(B) STRAIN: ATCC 10953 



PCT/CAOO/01150 



(xi) SEQUENCE DESCR: 

5 TACAACACCA ACTTGCTAAA 
GGTTGTGGGG AACGTGGAAA 
AGAAATTATT GACCCTAAGA 
TTATAGCTAA TACTTCTAAT 
TATACTGGTA TAACTATTGC 
10 GGCACTTATG GCAGATTCAA 
TGTCAGGACG TTTGGAAGAA 
CTATCAAGTA GAATAGCAGA 
TCTAGGTAAT GGAGAAGAAG 
CA 

15 



PTION: SEQ ID NO: 958 

TGGGCAGATG CAGAAGTAGT 
TGAAATGACC GATGTACTTA 
CAGGACAATC TTTAATGAAG 
ATGCCAGTTG CTGCTCGTGA 
AGAATATTTT AGAGATATGG 
CAAGTCGTTG GGCAGAAGCA 
ATGCCAGGTG ATGAAGGATA 
GTTTTATGAA AGAGCAGGGC 
GAGCATTAAC TGTAATTGGA 



TGTTTATGTT 50 

TGGAATTCCC ■ 100 

AGAACAGTTC 150 

GGCTTCAATC 200 

GATATTCAGT 250 

CTTCGTGAAA 300 

TCCAGCATAT 350 

TTGTTGAATG 400 

GCAGTATCTC 450 
452 



2) INFORMATION FOR SEQ ID NO: 959 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 69 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



25 



45 



60 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania aethiopica 
30 (B) STRAIN: ATCC 50119 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 959 

TGTAATTAGT CAGGCCCTCT CCAAGTACTC CAACTCCGAC TGCGTCATCT 50 

35 ATGTCGGCTG CGGCGAGCGC GGTAATGAGA TGGCCGAGGT GCTCATGGAG 100 

TTCCCGACCC TGACGACCGT GATCAATGGT CGCGAGGAGT CGATCATGAA 150 

GCGCACCTGC CTCGTGGCGA ACACTTCGAA CATGCCAGTC GCAGCCCGTG 200 

AGGCCTCTAT TTACACCGGC ATCACCCTGG CCGAGTACTA CCGTGATATG 250 

GGCAAGCATA TCGCCATGAT GGCCGACTCG ACGTCTCGCT GGGCCGAGGC 300 

40 GCTTCGTGAG ATTTCGGGTC GTCTGGCGGA GATGCCGGCC GATGGTGGCT 3 50 

ACCCTGCCTA TCTCAGCGCT CGTCTCGCCT CCTTCTACGA GCGCGCCGGC 4 00 

CTCGTCACCT GCATCGGCGG GCCGAAGCGC CAGGGCTCCG TCACGATTGT 4 50 

CGGTGCCGTG TCTCCGCCG 469 



2) INFORMATION FOR SEQ ID NO: 960 



(i) SEQUENCE CHARACTERISTICS: 
50- (A) LENGTH: 469 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

55 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania tropica 

(B) STRAIN: ATCC 3 0815 



534 
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10 



WO 01/23604 

(xi) SEQUENCE DESCRIPTION; 



SEQ ID NO: 960 



TGTAATTAGT 
ATGTCGGCTG 
TTCCCGACCC 
GCGCACCTGC 
AGGCCTCTAT 
GGCAAGCATA 
GCTTCGTGAG 
ACCCTGCCTA 
CTCGTCACCT 
CGGTGCCGTG 



CAGGCCCTCT 
CGGCGAGCGC 
TGACGACCGT 
CTCGTGGCGA 
TTACACCGGC 
TCGCCATGAT 
ATTTCGGGTC 
TCTCAGCGCT 
GCATCGGCGG 
TCTCCGCCG 



CCAAGTACTC 
GGTAATGAGA 
GATCGATGGT 
ACACTTCGAA 
ATCACCCTGG 
GGCCGACTCG 
GTTTGGCGGA 
CGTCTCGCCT 
GCCGAAGCGC 



CAACTCCGAC 
TGGCCGAGGT 
CGCGAGGAGT 
CATGCCAGTC 
CCGAGTACTA 
ACGTCTCGCT 
GATGCCGGCC 
CCTTCTACGA 
CAGGGCTCCG 
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TGCGTCATCT . 50 

GCTCATGGAG 100 

CGATCATGAA 150 

GCAGCCCGTG 2 00 

CCGTGATATG 250 

GGGCCGAGGC 300 

GATGGTGGCT 350 

GCGCGCCGGC 4 00 

TCACGATTGT 450 
469 



15 



20 



25 



30 



35 



40 



2) INFORMATION FOR SEQ ID NO: 961 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 469 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania guyanensis 

(B) STRAIN: ATCC 50126 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 961 



TGTCATCAGT 
ATGTCGGCTG 
TTCCCGACCC 
GCGCACCTGC 
AGGCCTCTAT 
GGCAAGCACA 
GCTGCGTGAG 
ACCCTGCCTA 
CTCGTCACCT 
CGGTGCAGTG 



CAGGCCCTCT 
CGGTGAACGC 
TGACGACTGT 
CTCGTGGCGA 
TTATACCGGC 
TTGCCATGAT 
ATTTCGGGTC 
CCTCAGCGCC 
GCATCGGCGG 
TCTCCACCG 



CCAAGTACTC 
GGTAACGAGA 
GATCGATGGT 
ACACCTCGAA 
ATCACCCTTG 
GGCCGACTCG 
GATTGGCGGA 
CGCCTCGCCT 
GCCGAAGCGC 



CAACTCGGAC 
TGGCCGAGGT 
CGCGAAGAGT 
CATGCCCGTC 
CTGAGTACTA 
ACGTCTCGCT 
GATGCCGGCT 
CCTTCTACGA 
CAGGGCTCCG 



TGTGTCATCT 
GCTCATGGAG 
CCATCATGAA 
GCAGCCCGTG 
CCGTGATATG 
GGGCCGAGGC 
GATGGTGGCT 
GCGCGCCGGT 
TCACGATCGT 



50 
100 
150 
200 
250 
300 
350 
400 
450 
469 



45 2) INFORMATION FOR SEQ ID NO: 962 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 469 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



50 



55 



60 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania donovani 

(B) STRAIN: ATCC 50212 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 962 
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TGTCATTAGT 
ATGTCGGCTG 
TTCCCGACCC 
GCGCACCTGC 
AGGCCTCTAT 
GGCAAGCATA 
GCTTCGTGAG 
ACCCCGCCTA 
CTCGTCACCT 
CGGTGCCGTG 



PCT/CA00/01150 



CAGGCCCTCT 
CGGCGAGCGC 
TGACGACCGT 
CTCGTGGCGA 
TTACACCGGC 
TCGCCATGAT 
ATTTCGGGTC 
CCTCAGCGCT 
GCATCGGCGG 
TCTCCACCG 



CCAAGTACTC 
GGTAATGAGA 
GATCGATGGC 
ACACCTCGAA 
ATCACCCTGG 
GGCTGACTCG 
GTCTGGCGGA 
CGTCTCGCCT 
GCCGAAGCGC 



CAACTCCGAT TGCGTCATCT 
TGGCCGAGGT GCTCATGGAG 
CGCGAGGAGT CGATCATGAA 
CATGCCAGTC GCAGCCCGTG 
CCGAGTACTA CCGTGATATG 
ACGTCTCGCT GGGCCGAGGC 
GATGCCGGCC GATGGTGGCT 
CCTTCTACGA GCGCGCCGGC 
CAGGGCTCCG TCACGATCGT 



50 
100 
150 
200 
250 
300 
350 
400 
450 
469 
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2) INFORMATION FOR SEQ ID NO:. 963 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 69 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania hertigi 

(B) STRAIN: ATCC 50125 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 963 

TGTGATCAGT CAGGCCCTTT CCAAGTACTC CAACTCGGAC TGCGTCATCT 50 

ACGTCGGCTG TGGTGAGCGC GGGAACGAGA TGGCCGAGGT GCTCATGGAT 100 

TTCCCGACTT TGACGACTGT GATCGATGGT CGCGAGGAGT CCATCATGAA 150 

GCGCACCTGC CTCGTGGCGA ACACCTCCAA CATGCCAGTT GCAGCCCGTG 200 

AGGCTTCTAT CTATACCGGC ATCACACTGG CTGAGTACTA TCGTGATATG 250 

GGCAAGCACA TTGCCATGAT GGCCGACTCG ACGTCTCGCT GGGCCGAGGC 300 

GTTGCGTGAG ATTTCCGGTC GGCTGGCGGA GATGCCGGCC GATGGTGGTT 3 50 

ACCCCGCCTA CCTCAGTGCC CGTCTCGCCT CCTTCTACGA GCGCGCTGGC 4 00 

CTCGTGACCT GTATCGGCGG GCCGAAGCGC CAGGGCTCCG TCACAATTGT 450 

TGGTGCGGTG TCTCCACCG 4 69 



2) INFORMATION FOR SEQ ID NO: 964 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 69 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania mexicana 

(B) STRAIN: ATCC 50156 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 964 

GGTCATTAGT CAGGCCCTTT CCAAGTACTC CAACTCTGAC TGCGTCATCT 50 
ACGTCGGCTG CGGCGAGCGC GGTAATGAGA TGGCCGAGGT GCTCATGGAG 100 
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TTCCCGACCC TGACGACCAT 
GCGCACCTGC CTCGTGGCGA 
AGGCCTCTAT CTACACCGGC 
GGCAAGCATA TCGCCATGAT 
GCTTCGTGAG ATTTCGGGTC 
ACCCCGCCTA CCTCAGCGCT 
CTCGTCACCT GCATCGGCGG 
CGGTGCCGTG TCTCCGCCG 



GATCGATGGT CGGGAGGAGT CGATCATGAA 
ACACCTCGAA CATGCCCGTC GCAGCCCGTG 
ATCACCCTCG CCGAGTACTA CCGTGATATG 
GGCCGACTCG ACGTCTCGCT GGGCTGAGGC 
GTCTGGCGGA GATGCCGGCC GATGGTGGCT 
CGTCTCGCCT CCTTCTACGA GCGCGCCGGC 
GCCGAAGCGC CAGGGCTCCG TCACGATCGT 



150 
200 
250 
300 
350 
400 
450 
469 



2) INFORMATION FOR SEQ ID NO: 965 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 69 bases 

(B) TYPE : ■ Nucleic acid 

(C) STRAND EDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania tropica 

(B) STRAIN: ATCC 50129 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 965 

TGTAATTAGT CAGGCCCTCT CCAAGTACTC CAACTCCGAC TGCGTCATCT 50 

ATGTCGGCTG CGGCGAGCGC GGTAATGAGA TGGCCGAGGT GCTCATGGAG 100 

TTCCCGACCC TGACGACCGT GATCGATGGT CGCGAGGAGT CGATCATGAA 150 

GCGCACCTGC CTCGTGGCGA ACACTTCGAA CATGCCAGTC GCAGCCCGTG 2 00 

AGGCCTCTAT TTACACCGGC ATCACCCTGG CCGAGTACTA CCGTGATATG 250 

GGCAAGCATA TCGCCATGAT GGCCGACTCG ACGTCTCGCT GGGCCGAGGC 3 00 

GCTTCGTGAG ATTTCGGGTC GTTTGGCGGA GATGCCGGCC GATGGTGGCT 350 

ACCCTGCCTA TCTCAGCGCT CGTCTCGCCT CCTTCTACGA GCGCGCCGGC 4 00 

CTCGTCACCT GCATCGGCGG GCCGAAGCGC CAGGGCTCCG TCACGATTGT 4 50 

CGGTGCCGTG TCTCCGCCG 4 69 



2) INFORMATION FOR SEQ ID NO: 966 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 449 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Peptostreptococcus anaerobius 

(B) STRAIN: ATCC 27337 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 966 

CACCAGTTCG CCAAGTGGGC AGATGCTCAG ATAGTTGTAT ACGTTGGTTG 50 

TGGAGAACGT GGTAACGAGA TGACAGACGT TCTAAATGAA TTCCCAGAAC 100 

TGATTGACCC TCATACAGGC GAATCTCTAA TGAAGAGAAC AGTTCTTATA 150 

GCTAATACGT CAAATATGCC AGTTGCAGCC AGAGAGGCAA GTATATATAC 2 00 
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AGGTATTACA ATAGCTGAAT ATTTTAGAGA 
TAATGGCCGA CTCTACATCA AGATGGGCAG 
GGTCGTCTAG AAGAAATGCC TGGTGATGAA 
TTCTAGAGCT GCAGAGTTCT ATGAAAGAGC 
GTAGCGATAA TAGAGAGGGA GCCCTTACAA 



CATGGGATAT TCAGTAGCGG 
AGGCCCTAAG AGAGATGTCA 
GGTTATCCAG CCTATCTAGG 
AGGTAAGGTA ATATGTAAGG 
TAATAGGTGC CGTGTCACC 



250 
300 
350 
400 
449 
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2) INFORMATION FOR SEQ ID NO: 967 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 826 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bordetella pertussis 

(B) STRAIN: ATCC 9797 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 967 

CTGGTGGTGT CGGCCGCAGA CGGCCCGATG CCGCAGACGC GCGAGCACAT 50 

TTTGCTGTCG CGCCAGGTTG GCGTGCCGTA CATCATCGTG TTCCTGAACA 100 

AGGCGGACAT GGTTGATGAC GCGGAGCTGC TCGAGCTGGT GAAGATGGAA 150 

GTCCGCGAAC TGCTGAGCAA GTACGATTTC CCGGGCGATG ACACGCCGAT 200 

CGTGAAGGGT TCGGCCAAGC TGGCGCTGGA AGGCGACAAG GGCGAACTGG 250 

GCGAGCAGGC GATTCTGTCG CTGGCGCAAG CGCTGGACAC GTACATTCCG 300 

ACGCCGGAGC GCGCGGTCGA CGGTGCGTTC CTGATGCCGG TGGAAGACGT 350 

GTTCTCGATC TCGGGCCGTG GCACGGTGGT GACTGGCCGT ATCGAGCGCG 4 00 

GCGTGGTGAA GGTTGGCGAG GAAATCGAAA TCGTGGGCAT CAAGCCGACG 450 

GTGAAGACGA CCTGCACGGG CGTGGAGATG TTCCGCAAGC TGCTGGACCA 500 

GGGCCAGGCG GGCGACAACG TGGGTATCTT GCTGCGCGGC ACCAAGCGTG 550 

AAGACGTCGA GCGTGGCCAG GTGCTGGCCA AGCCGGGTTC GATCAACCCG 600 

CACACGGACT TCACGGCCGA GGTGTACATT CTGTCCAAGG AAGAGGGTGG 650 

CCGTCACACG CCGTTCTTCA ACGGCTATCG TCCGCAGTTC TACTTCCGCA 700 

CGACGGACGT GACCGGCACG ATCGACCTGC CGGCGGACAA GGAAATGGTG 750 

CTGCCGGGCG ACAACGTGTC GATGACCGTC AAGCTGCTGG CCCCGATCGC 800 

CATGGAAGAA GGTCTGCGTT TCGCCA 826 



2) INFORMATION FOR SEQ ID NO: 968 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 817 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bordetella pertussis 

(B) STRAIN: BD180 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 968 
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CGATCCTGGT GGTGTCGGCC GCAGACGGCC 

CACATTTTGC TGTCGCGCCA GGTTGGCGTG 

GAACAAGGCG GACATGGTTG ATGACGCGGA 

TGGAAGTCCG CGAACTGCTG AGCAAGTACG 

5 CCGATCGTGA AGGGTTCGGC CAAGCTGGCG 

ACTGGGCGAG CAGGCGATTC TGTCGCTGGC 

TTCCGACGCC GGAGCGCGCG GTCGACGGTG 

GACGTGTTCT CGATCTCGGG CCGTGGCACG 

GCGCGGCGTG GTGAAGGTTG GCGAGGAAAT 

10 CGACGGTGAA GACGACCTGC ACGGGCGTGG 

GACCAGGGCC AGGCGGGCGA CAACGTGGGT 

GCGTGAAGAC GTCGAGCGTG GCCAGGTGCT 

ACCCGCACAC GGACTTCACG GCCGAGGTGT 

GGTGGCCGTC ACACGCCGTT CTTCAACGGC 

15 CCGCACGACG GACGTGACCG GCACGATCGA, 

TGGTGCTGCC GGGCGACAAC GTGTCGATGA 
ATCGCCATGG AAGAAGG 



PCT/CA00/01150 

CGATGCCGCA GACGCGCGAG 50 

CCGTACATCA TCGTGTTCCT 100 

GCTGCTCGAG CTGGTGGAGA 150 

ATTTCCCGGG CGATGACACG 200 

CTGGAAGGCG ACAAGGGCGA 250 

GCAAGCGCTG GACACGTACA 300 

CGTTCCTGAT GCCGGTGGAA 350 

GTGGTGACTG GCCGTATCGA 400 

CGAAATCGTG GGCATCAAGC 4 50 

AGATGTTCCG CAAGCTGCTG 500 

ATCTTGCTGC GCGGCACCAA 550 

GGCCAAGCCG GGTTCGATCA 600 

ACATTCTGTC CAAGGAAGAG 650 

TATCGTCCGC AGTTCTACTT 700 

CCTGCCGGCG GACAAGGAAA 750 

CCGTCAAGCT GCTGGCCCCG 800 

817 



20 



30 



2) INFORMATION FOR SEQ ID NO: 969 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 637 bases 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus columbae 

(B) STRAIN: ATCC 51263- 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 969 

CCTATGCCAC AAACTCGTGA ACACATTCTT TTATCACGTA ACGTTGGTGT 50 

GCCATACATC GTTGTTTTCT TAAACAAAGT TGATATGGTT GACGACGAAG 100 

AATTATTAGA ATTAGTTGAA ATGGAAGTTC GTGACTTATT AACTGAATAT 150 

4 0 GACTTCCCAG GAGACGATGT TCCTGTAATC GCTGGTTCTG CATTAAAAGC 200 

TTTAGAAGGC GACCCTGCTT ACGAAGAAAA AATCTTAGAA TTAATGGCTG 250 

CAGTTGACGA ATACATCCCA ACTCCAGAAC GTGACAACGA CAAACCATTC 300 

ATGATGCCAG TTGAAGACGT GTTCTCAATT ACTGGTCGTG GTACTGTTGC 350 

TACAGGTCGT GTTGAACGTG GACAAGTTCG TGTTGGTGAC GAAGTTGAAA 400 

45 TCGTTGGTAT CGCTGACGAA ACTTCTAAAA CAACAGTTAC TGGTGTTGAA 450 

ATGTTCCGTA AATTATTAGA TTACGCTGAA GCTGGAGACA ACATCGGTGC 500 

ATTATTACGT GGTGTGGCTC GTGAAGACAT CCAACGTGGT CAAGTATTAG 550 

CTAAACCAGG TTCAATCACT CCACATACAA AATTCACTGC TGAAGTGTAC 600 

GTTTTAACTA AAGAAGAAGG TGGACGTCAT ACTCCAT 637 

50 



2) INFORMATION FOR SEQ ID NO: 970 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 634 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

60 
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(ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus flavescens 

(B) STRAIN: ATCC 4 9997 



(xi) SEQUENCE DESCRIPTION: SEQ 



ID NO: 970 



TATGCCTCAA 
CATACATCGT 
TTACTAGAAT 
CTTCCCAGGC 
TTGAAGGCGA 
GTTGACGAAT 
GATGCCAGTC 
CAGGCCGTGT 
GTTGGTATTG 
GTTCCGTAAA 
TGCTACGTGG 
AAAGCTGGTA 
TTTAACAAAA 



ACACGTGAAC 
TGTTTTCTTA 
TAGTTGAAAT 
GACGATGTTC 
TGCTTCATAC 
ACGTTCCAAC 
GAAGACGTAT 
TGAACGTGGA 
CTGAAGAAAC 
TTGTTAGACT 
GGTTGCTCGT 
CAATCACACC 
GAAGAAGGTG 



ACATCTTGTT 
AACAAAATGG 
GGAAGTTCGT 
CTGTAATCGC 
GAAGAAAAAA 
TCCAGAACGT 
TCTCAATCAC 
CAAGTTCGCG 
TGCTAAAACA 
ATGCTGAAGC 
GAAGACATCC 
TCATACAAAA 
GACGTCACAC 



ATCACGTAAC GTTGGTGTAC 
ATATGGTTGA TGACGAAGAA 
GACTTATTGT CAGAATATGA 
TGGTTCTGCT TTGAAAGCTC 
TCATGGAATT AATGGCTGCA 
GACACTGACA AACCATTCAT 
TGGACGTGGT ACTGTTGCTA 
TTGGTGACGA AGTTGAAATC 
ACTGTAACTG GTGTTGAAAT 
AGGGGATAAC ATTGGTGCAT 
AACGTGGACA AGTATTAGCT 
TTTAAAGCTG AAGTTTACGT 
TCCA 



50 
100 
150 
200 
250 
300 
350 
400 
450 
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550 
600 
634 



2) INFORMATION FOR SEQ ID NO: 971 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 787 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-55 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 971 

GGACCAATGC CACAAACTCG TGAGCACATC CTTCTTTCAC GTCAGGTTGG 50 

TGTTAAACAC CTTATCGTCT TCATGAACAA AGTTGACTTG GTTGACGACG 100 

AAGAATTGCT TGAATTGGTT GAAATGGAAA TCCGTGACCT ATTGTCAGAA 150 

TACGACTTCC CAGGTGACGA TCTTCCAGTT ATCCAAGGTT CAGCACTTAA 200 

AGCTCTTGAA GGTGACTCTA AATACGAAGA CATCGTTATG GAATTGATGA 250 

ACACAGTTGA TGAGTATATC CCAGAACCAG AACGTGACAC TGACAAACCA 300 

TTGCTTCTTC CAGTCGAGGA CGTATTCTCA ATCACTGGAC GTGGTACAGT 350 

TGCTTCAGGA CGTATCGACC GTGGTATCGT TAAAGTCAAC GACGAAATCG 400 

AAATCGTTGG TATCAAAGAA GAAACTCAAA AAGCAGTTGT TACTGGTGTT 450 

GAAATGTTCC GTAAACAACT TGACGAAGGT CTTGCTGGAG ATAACGTAGG 500 

TGTCCTTCTT CGTGGTGTTC AACGTGATGA AATCGAACGT GGACAAGTTA 550 

TCGCTAAACC AGGTTCAATC AACCCACACA CTAAATTCAA AGGTGAAGTC 600 

TACATCCTTA CTAAAGAAGA AGGTGGACGT CACACTCCAT TCTTCAACAA 650 

CTACCGTCCA CAATTCTACT TCCGTACTAC TGACGTTACA GGTTCAATCG 700 

AACTTCCAGC AGGTACTGAA ATGGTAATGC CTGGTGATAA CGTGACAATC 750 

GACGTTGAGT TGATTCACCC AATCGCCGTA GAACAAG 787 



2) INFORMATION FOR SEQ ID NO: 972 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 803 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 

(B) STRAIN: ATCC 35401 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 972 

TAGTTGCTGC GACTGACGGC CCGATGCCGC AGACTCGTGA 
CTGGGTCGTC AGGTAGGCGT TCCGTACATC ATCGTGTTCC 
CGACATGGTT GATGACGAAG AGCTGCTGGA ACTGGTTGAA 
GTGAACTTCT GTCTCAGTAC GACTTCCCGG GCGACGACAC 
CGTGGTTCTG CTCTGAAAGC GCTGGAAGGC GACGCAGAGT 
AATCCTGGAA CTGGCTGGCT TCCTGGATTC TTACATTCCG 
GTGCGATTGA CAAGCCGTTC CTGCTGCCGA TCGAAGACGT 
TCCGGTCGTG GTACCGTTGT TACCGGTCGT GTAGAACGCG 
AGTTGGTGAA GAAGTTGAAA TCGTTGGTAT CAAAGAGACT 
CCTGTACTGG CGTTGAAATG TTCCGCAAAC TGCTGGACGA 
GGTGAGAACG TAGGTGTTCT GCTGCGTGGT ATCAAACGTG 
ACGTGGTCAG GTACTGGCTA AGCCGGGCAC CATCAAGCCG 
TCGAATCTGA AGTGTACATT CTGTCCAAAG ATGAAGGCGG 
CCGTTCTTCA AAGGCTACCG TCCGCAGTTC TACTTCCGTA 
GACTGGTACC ATCGAACTGC CGGAAGGCGT AGAGATGGTA 
ACAACATCAA AATGGTTGTT ACCCTGATCC ACCCGATCGC 
GGT 



GCACATCCTG 50 

TGAACAAATG 100 

ATGGAAGTTC 150 

TCCGATCGTT 200 

GGGAAGCGAA 250 

GAACCAGAGC 300 

ATTCTCCATC 350 

GTATCATCAA 4 00 

CAGAAGTCTA 4 50 

AGGCCGTGCT 500 

AAGAAATCGA 550 

CACACCAAGT 600 

CCGTCATACT 650 

CTACTGACGT 700 

ATGCCGGGCG 750 

GATGGACGAC 800 
803 



35 



2) INFORMATION FOR SEQ ID NO: 973 



40 



45 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 762 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 

(B) STRAIN: ATCC 43895 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 973 



ACTGACGGCC 
GGTAGGCGTT 
ATGACGAAGA 

55 TCTCAGTACG 
TCTGAAAGCG 
TGGCTGGCTT 
AAGCCGTTCC 
TACCGTTGTT 

60 AAGTTGAAAT 



CGATGCCGCA 
CCGTACATCA 
GCTGCTGGAA 
ACTTCCCGGG 
CTGGAAGGCG 
CCTGGATTCC 
TGCTGCCGAT 
ACCGGTCGTG 
CGTTGGTATC 



GACTCGTGAG 
TCGTGTTCCT 
CTGGTTGAAA 
CGACGACACT 
ACGCAGAGTG 
TACATTCCGG 
CGAAGACGTA 
TAGAACGCGG 
AAAGAGACTC 



CACATCCTGC 
GAACAAATGC 
TGGAAGTTCG 
CCGATCGTTC 
GGAAGCGAAA 
AACCAGAGCG 
TTCTCCATCT 
TATCATCAAA 
AGAAGTCTAC 



TGGGTCGTCA 50 

GACATGGTTG 100 

TGAACTTCTG 150 

GTGGTTCTGC 200 

ATCCTGGAAC 250 

TGCRATTGAC 300 

CCGGTCGTGG 3 50 

GTTGGTGAAG 4 00 

CTGTACTGGC 4 50 



541 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

GTTGAAATGT TCCGCAAACT GCTGGACGAA 

AGGTGTTCTG CTGCGTGGTA TCAAACGTGA 

TACTGGCTAA GCCGGGCACC ATCAAGCCGC 

GTGTACATTC TGTCCAAAGA TGAAGGCGGC 

5 AGGCTACCGT CCGCAGTTCT ACTTCCGTAC 

TTGAACTGCC GGAAGGCGTA GAGATGGTAA 
ATGGTTGTTA CC 



PCT/CA00/01150 

GGCCGTGCTG GTGAGAACGT 500 

AGAAATCGAA CGTGGTCAGG 550 

ACACCAAGTT CGAATCTGAA 600 

CGTCATACTC CGTTCTTCAA 650 

TACTGACGTG ACTGGTACCA 700 

TGCCGGGCGA CAACATCAAA 750 

762 



10 



2) INFORMATION FOR SEQ ID NO: 974 



15 



20 



25 



30 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 804 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 

(B) STRAIN: ATCC 11775 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 974 



CCTGGTAGTT 
TCCTGCTGGG 
AAATGCGACA 
AGTTCGTGAA 
TCGTTCGTGG 
GCGAAAATCC 
AGAGCGTGCG 
CCATCTCCGG 
ATCAAAGTTG 
GTCTACCTGT 
GTGCTGGTGA 
ATCGAACGTG 
CAAGTTCGAA 
ATACTCCGTT 
GACGTGACTG 
GGGCGACAAC 
ACGA 



GCTGCGACTG 
TCGTCAGGTA 
TGGTTGATGA 
CTTCTGTCTC 
TTCTGCTCTG 
TGGAACTGGC 
ATTGACAAGC 
TCGTGGTACC 
GTGAAGAAGT 
ACTGGCGTTG 
GAACGTAGGT 
GTCAGGTACT 
TCTGAAGTGT 
CTTCAAAGGC 
GTACCATCGA 
ATCAAAATGG 



ACGGCCCGAT 
GGCGTTCCGT 
CGAAGAGCTG 
AGTACGACTT 
AAAGCGCTGG 
TGGCTTCCTG 
CGTTCCTGCT 
GTTGTTACCG 
TGAAATCGTT 
AAATGTTCCG 
GTTCTGCTGC 
GGCTAAGCCG 
ACATTCTGTC 
TACCGTCCGC 
ACTGCCGGAA 
TTGTTACCCT 



GCCGCAGACT 
ACATCATCGT 
CTGGAACTGG 
CCCGGGCGAC 
AAGGCGACGC 
GATTCCTACA 
GCCGATCGAA 
GTCGTGTAGA 
GGTATCAAAG 
CAAACTGCTG 
GCGGTATCAA 
GGCACCATCA 
CAAAGATGAA 
AGTTCTACTT 
GGCGTAGAGA 
GATCCACCCG 



CGTGAGCACA 50 

GTTCCTGAAC 100 

TTGAAATGGA 150 

GACACTCCGA 2 00 

AGAGTGGGAA 250 

TTCCGGAACC 300 

GACGTATTCT 350 

ACGCGGTATC 4 00 

AGACTCAGAA 450 

GACGAAGGCC 500 

ACGTGAAGAA 550 

AGCCGCACAC 600 

GGCGGTCGTC 650 

CCGTACTACT 700 

TGGTAATGCC 750 

ATCGCGATGG 800 
804 



45 



50 



55 



60 



2) INFORMATION FOR SEQ ID NO: 975 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 804 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 

(B) STRAIN: ATCC 25922 

542 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CA00/01150 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 975 

GCGATCCTGG TAGTTGCTGC GACTGACGGC CCGATGCCGC AGACTCGTGA 50 

GCACATCCTG CTGGGTCGTC AGGTAGGCGT TCCGTACATC ATCGTGTTCC 100 

5 TGAACAAATG CGACATGGTT GATGACGAAG AGCTGCTGGA ACTGGTTGAA 150 

ATGGAAGTTC GTGAACTTCT GTCTCAGTAC GACTTCCCGG GCGACGACAC 200 

TCCGATCGTT CGTGGTTCTG CTCTGAAAGC GCTGGAAGGC GACGCAGAGT 2 50 

GGGAAGCGAA AATCCTGGAA CTGGCTGGCT TCCTGGATTC YTACATTCCG 3 00 

GAACCAGAGC GTGCGATTGA CAAGCCGTTC CTGCTGCCGA TCGAAGACGT 350 

10 ATTCTCCATC TCCGGTCGTG GTACCGTTGT TACCGGTCGT GTAGAACGCG 4 00 

GTATCATCAA AGTTGGTGAA GAAGTTGAAA TCGTTGGTAT CAAAGAGACT 450 

CAGAAGTCTA CCTGTACTGG CGTTGAAATG TTCCGCAAAC TGCTGGACGA 500 

AGGCCGTGCT GGTGAGAACG TAGGTGTTCT GCTGCGTGGT ATCAAACGTG 550 

AAGAAATCGA ACGTGGTCAG GTACTGGCTA AGCCGGGCAC CATCAAGCCG 600 

15 CACACCAAGT TCGAATCTGA AGTGTACATT , CTGTCCAAAG ATGAAGGCGG 650 

TCGTCATACT CCGTTCTTCA AAGGCTACCG TCCGCAGTTC TACTTCCGTA 700 

CTACTGACGT GACTGGTACC ATCGAACTGC CGGAAGGTGT AGAGATGGTA 750 

ATGCCGGGCG ACAACATCAA AATGGTTGTT ACCCTGATCC ACCCGATCGC 800 

GATG 804 

20 



2) INFORMATION FOR SEQ ID NO: 976 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 825 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



30 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mycobacterium avium 
35 (B) STRAIN: Mavi-1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 976 

GGCGCGATCC TGGTGGTCGC CGCCACCGAC GGCCCGATGC CGCAGACCCG 50 

4 0 TGAGCACGTG CTGCTCGCCC GTCAGGTCGG TGTGCCCTAC ATCCTGGTCG 100 

CCCTGAACAA GGCCGACATG GTCGACGACG AGGAGCTCCT CGAGCTCGTC 150 

GAGATGGAGG TCCGCGAGCT GCTGGCCGCC CAGGAGTTCG ACGAGGACGC 200 

CCCGGTGGTG CGGGTCTCGG CGCTCAAGGC GCTCGAGGGC GACGCCAAGT 250 

GGGTCGAGTC CGTCGAGCAG CTGATGGAGG CCGTCGACGA GTCGATCCCG 300 

45 GACCCGGTCC GCGAGACGGA GAAGCCGTTC CTGATGCCGG TGGAGGACGT 350 

CTTCACCATC ACCGGGCGTG GCACCGTGGT CACCGGTCGT GTCGAGCGCG 400 

GTGTGATCAA CGTGAACGAG GAAGTCGAGA TCGTCGGCAT CCGCCCGACC 450 

AGCACCAAGA CCACGGTCAC CGGTGTGGAR ATGTTCCGCA AGCTGCTCGA 500 

CCAGGGCCAG GCCGGTGACA ACGTCGGTCT GCTGCTGCGT GGTATCAAGC 550 

50 GTGAGGACGT CGAGCGCGGC CAGGTCGTGA CCAAGCCCGG CACCACCACG 600 

CCGCACACCG AGTTCGAGGG CCAGGTCTAC ATCCTGTCCA AGGACGAGGG 650 

CGGCCGGCAC ACGCCGTTCT TCAACAACTA CCGTCCGCAG TTCTACTTCC 700 

GCACCACCGA CGTGACCGGT GTGGTGACGC TGCCGGAGGG CACCGAGATG 750 

GTGATGCCCG GTGACAACAC CAACATCTCG GTGAAGCTGA TCCAGCCCGT 800 

55 CGCCATGGAC GACGGTSTGC GGTTC 825 



60 



2) INFORMATION FOR SEQ ID NO: 977 

543 



SUBSTITUTE SHEET (RULE 26) 



10 



WO 01/23604 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 820 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneu/noniae 

(B) STRAIN: StrR-06 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 977 



PCT/CA00/01150 



15 TATCCTTGTA 
ACATCCTTCT 
AACAAAGTTG 
GGAAATCCGT 
CAGTTATCCA 

20 GAAGACATCG 
ACCAGAACGT 
TCTCAATCAC 
ATCGTTAAAG 
TCAAAAAGCA 

25 AAGGTCTTGC 
GATGAAATCG 
ACACACTAAA 
GACGTCACAC 
ACTACTGACG 

30 AATGCCTGGT 
CCGTAGAACA 



GTAGCTTCAA 
TTCACGTCAG 
ACTTGGTTGA 
GACCTATTGT 
AGGTTCAGCA 
TTATGGAATT 
GACACTGACA 
TGGACGTGGT 
TCAACGACGA 
GTTGTTACTG 
TGGAGATAAC 
AACGTGGACA 
TTCAAAGGTG 
TCCATTCTTC 
TTACAGGTTC 
GATAACGTGA 
AGGTACTACA 



CTGACGGACC 
GTTGGTGTTA 
CGACGAAGAA 
CAGAATACGA 
CTTAAAGCTC 
GATGAACACA 
AACCATTGCT 
ACAGTTGCTT 
AATCGAAATT 
GTGTTGAAAT 
GTAGGTGTCC 
AGTTATCGCT 
AAGTCTACAT 
AACAACTACC 
AATCGAACTT 
CAATCGACGT 



AATGCCACAA 
AACACCTTAT 
TTGCTTGAAT 
CTTCCCAGGT 
TTGAAGGTGA 
GTTGATGAGT 
TCTTCCAGTC 
CAGGACGTAT 
GTTGGTATCA 
GTTCCGTAAA 
TTCTTCGTGG 
AAACCAGGTT 
CCTTACTAAA 
GTCCACAATT 
CCAGCAGGTA 
TGAGTTGATT 



ACTCGTGAGC 50 

CGTCTTCATG 100 

TGGTTGAAAT 150 

GACGATCTTC 200 

CTCTAAATAC 250 

ATATCCCAGA 300 

GAGGACGTAT 350 

CGACCGTGGT 4 00 

AAGAAGAAAC 450 

CAACTTGACG 500 

TGTTCAACGT 550 

CAATCAACCC 600 

GAAGAAGGTG 650 

CTACTTCCGT 700 

CTGAAATGGT 750 

CACCCAATCG 800 
820 



35 2) INFORMATION FOR SEQ ID NO: 978 



40 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 822 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



45 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mycobacterium gordonae 

(B) STRAIN: M-Gor-1 



(xi) SEQUENCE DESCRIPTION: SEQ 



50 



55 



60 



GGCGCGATCC TGGTGGTCGC 
TGAGCACGTG CTGCTCGCCC 
CGCTGAACAA GTCCGACGCG 
GAGCTGGAGG TCCGCGAGTT 
TCCGGTGGTC CGGGTCTCGG 
GGGTGAAGTC GGTAGAGGAC 
GACCCGGTCC GCGACACCGA 
CTTCACCATC ACCGGTCGTG 
GCGTGGTGAA CGTGAACGAG 
AGCACCAAGA CCACGGTCAC 



CGCCACCGAT 
GTCAGGTGGG 
GTCGACGACG 
GCTGGCCGCC 
CGCTGAAGGC 
TTGATGGACG 
CAAGCCGTTC 
GCACCGTCGT 
GAAGTCGAGA 
CGGTGTGGAG 



ID NO: 978 






GGCCCGATGC 


CGCAGACCCG 


50 


CGTGCCCTAC 


ATCCTGGTGG 


100 


AGGAGCTGCT 


CGAGCTCGTC 


150 


CAGGACTTCG 


ACGAGGAAGC 


200 


GCTCGAGGGC 


GACGCCACCT 


250 


CGGTCGACGA 


GTCGATTCCG 


300 


CTGATGCCCG 


TCGAGGACGT 


350 


CACCGGCCGT 


GTGGAGCGCG 


400 


TCGTCGGCAT 


CAAGCCGACC 


450 


ATGTTCCGCA 


AGCTGCTCGA 


500 



544 
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CCAGGGTCAG GCCGGTGACA 
GTGAGGACGT CGAGCGCGGC 
CCGCACACCG AGTTCGAGGG 
CGGCCGGCAC ACGCCGTTCT 
GCACCACCGA CGTGACCGGT 
GTGATGCCCG GTGACAACAC 
CGCCATGGAC GACGGTCTGC 



ACGTCGGTCT 
CAGGTCGTCA 
TCAGGTCTAC 
TCAACAACTA 
GTGGTGACGC 
CAACATCTCG 
GG 



GCTGCTGCGT GGTGTCAAGC 
TCAAGCCCGG CACCACCACT 
ATCCTGTCCA AGGACGAGGG 
CCGTCCGCAG TTCTACTTCC 
TGCCGGAGGG CACCGAAATG 
GTGAAGCTGA TCCAGCCCGT 



550 
600 
650 
700 
750 
800 
822 



2) INFORMATION FOR SEQ ID NO: 979 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 821 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-11 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 979 

CTATCCTTGT AGTAGCTTCA ACTGACGGAC CAATGCCACA AACTCGTGAG 50 

CACATCCTTC TTTCACGTCA GGTTGGTGTT AAACACCTTA TCGTCTTCAT 100 

GAACAAAGTT GACTTGGTTG ACGACGAAGA ATTGCTTGAA TTGGTTGAAA 150 

TGGAAATCCG TGACCTATTG TCAGAATACG ACTTCCCAGG TGACGATCTT 200 

CCAGTTATCC AAGGTTCAGC ACTTAAAGCT CTTGAAGGTG ACTCTAAATA 250 

CGAAGACATC GTTATGGAAT TGATGAACAC AGTTGATGAG TATATCCCAG 300 

AACCAGAACG TGACACTGAC AAACCATTGC TTCTTCCAGT CGAGGACGTA 3 50 

TTCTCAATCA CTGGACGTGG TACAGTTGCT TCAGGACGTA TCGACCGTGG 4 00 

TATCGTTAAA GTCAACGACG AAATCGAAAT CGTTGGTATC AAAGAAGAAA 4 50 

CTCAAAAAGC AGTTGTTACT GGTGTTGAAA TGTTCCGTAA ACAACTTGAC 500 

GAAGGTCTTG CTGGAGATAA CGTAGGTGTC CTTCTTCGTG GTGTTCAACG 550 

TGATGAAATC GAACGTGGAC AAGTTATCGC TAAACCAGGT TCAATCAACC 600 

CACACACTAA ATTCAAAGGT GAAGTCTACA TCCTTACTAA AGAAGAAGGT 65 0 

GGACGTCACA CTCCATTCTT CAACAACTAC CGTCCACAAT TCTACTTCCG 700 

TACTACTGAC GTTACAGGTT CAATCGAACT TCCAGCAGGT ACTGAAATGG 750 

TAATGCCTGG TGATAACGTG ACAATCGACG TTGAGTTGAT TCACCCAATC 800 

GCCGTAGAAC AAGGTACTAC A 821 



2) INFORMATION FOR SEQ ID NO: 980 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 828 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mycobacterium tuberculosis 



(B) STRAIN: ATCC 25177 



545 
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(xi) SEQUENCE DESCRIPTION: SEQ 



ID NO: 980 



GGTGCGATCC 
CGAGCACGTT 
CGCTGAACAA 
GAGATGGAGG 
CCCGGTTGTG 
GGGTTGCCTC 
GACCCGGTCC 
CTTCACCATT 
GCGTGATCAA 
ACCACCAAGA 
CCAGGGCCAG 
GCGAGGACGT 
CCGCACACCG 
CGGCCGGCAC 
GCACCACCGA 
GTGATGCCCG 
CGCCATGGAC 



TGGTGGTCGC 
CTGCTGGCGC 
GGCCGACGCA 
TCCGCGAGCT 
CGGGTCTCGG 
TGTCGAGGAA 
GCGAGACCGA 
ACCGGCCGCG 
CGTGAACGAG 
CCACCGTCAC 
GCGGGCGACA 
CGAGCGTGGC 
AGTTCGAAGG 
ACGCCGTTCT 
CGTGACCGGT 
GTGACAACAC 
GAAGGTNTGC 



CGCCACCGAC 

GTCAAGTGGG 

GTGGACGACG 

GCTGGCTGCC 

CGCTCAAGGC 

CTGATGAACG 

CAAGCCGTTC 

GAACCGTGGT 

GAAGTTGAGA 

CGGTGTGGAG 

ACGTTGGTTT 

CAGGTTGTCA 

CCAGGTCTAC, 

TCAACAACTA 

GTGGTGACAC 

CAACATCTCG 

GTTTCGCG 



GGCCCGATGC CCCAGACCCG 
TGTGCCCTAC ATCCTGGTAG 
AGGAGCTGCT CGAACTCGTC 
CAGGAATTCG ACGAGGACGC 
GCTCGAGGGT GACGCGAAGT 
CGGTCGACGA GTCGATTCCG 
CTGATGCCGG TCGAGGACGT 
CACCGGACGT GTGGAGCGCG 
TCGTCGGCAT TCGCCCATCG 
ATGTTCCGCA AGCTGCTCGA 
GCTGCTGCGG GGCGTCAAGC 
CCAAGCCCGG CACCACCACG 
ATCCTGTCCA AGGACGAGGG 
CCGTCCGCAG TTCTACTTCC 
TGCCGGAGGG CACCGAGATG 
GTGAAGTTGA TCCAGCCCGT 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
828 



2) INFORMATION FOR SEQ ID NO: 981 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 819 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

<vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus warneri 

(B) STRAIN: CSG 144 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 981 

CGGCGGTATC TTAGTAGTAT CTGCTGCAGA TGGCCCAATG CCACAAACTC 50 

GTGAACACAT CTTATTATCA CGTAACGTTG GTGTACCAGC TTTAGTTGTA 100 

TTCTTAAACA AAGCTGACAT GGTTGACGAC GAAGAATTAT TAGAATTAGT 150 

TGAAATGGAA GTTCGTGACT TATTATCTGA ATACGACTTC CCTGGTGACG 200 

ATGTACCAGT TATCGTTGGT TCTGCATTAA AAGCTTTAGA AGGCGACCCA 250 

GAATACGAAC AAAAAATCTT AGACTTAATG CAAGCTGTAG ATGACTACAT 300 

CCCAACTCCA GAACGTGACT CTGATAAACC ATTCATGATG CCAGTTGAGG 350 

ACGTATTCTC AATCACTGGT CGTGGTACTG TAGCAACAGG CCGTGTTGAA 4 00 

CGTGGTCAAA TCAAAGTCGG TGAAGAAGTT GAAATCATCG GTATCACTGA 450 

AGAAAGCAAG AAAACAACAG TTACAGGTGT AGAAATGTTC CGTAAATTAT 500 

TAGACTACGC TGAAGCTGGT GACAACATCG GTGCTTTATT ACGTGGTGTT 550 

GCACGTGAAG ACGTACAACG TGGACAAGTA TTAGCAGCTC CTGGCTCTAT 600 

TACTCCACAC ACAAAATTCA AAGCTGATGT TTACGTTTTA TCTAAAGAAG 650 

AAGGTGGACG TCATACTCCA TTCTTCACTA ACTACCGCCC ACAATTCTAC 700 

TTCCGTACTA CTGACGTAAC TGGCGTTGTT CACTTACCAG AAGGTACTGA 750 

AATGGTTATG CCTGGCGATA ACGTAGAAAT GACTGTTGAA TTAATCGCTC 800 

CAATCGCGAT TGAAGACGG 819 



2) INFORMATION FOR SEQ ID NO: 982 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 814 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus mitis 

(B) STRAIN: LSPQ 2583 

(xi) SEQUENCE DESCRIPTION: SEQ 

CTATCCTTGT AGTAGCTTCA ACTGACGGAC 
CACATCCTTC TTTCACGTCA GGTTGGTGTT 
GAACAAGATC GACTTGGTTG ATGACGAAGA 
TGGAAATCCG TGACCTCTTG, TCAGAATACG 
CCAGTTATCC AAGGTTCAGC TCTTAAAGCT 
CGAAGACATC ATCATGGAAT TGATGAACAC 
AACCAGAACG TGATACTGAC AAACCTCTTC 
TTCTCAATCA CTGGTCGTGG TACAGTTGCT 
TACTGTTCGT GTCAACGATG AAATCGAAAT 
TCCAAAAAGC AGTTGTTACT GGTGTTGAAA 
GAAGGTCTTG CAGGGGACAA CGTAGGTGTA 
TGATGAAATC GAACGTGGTC AAGTTATCGC 
CACACACTAA ATTCAAGGGT GAAGTTTACA 
GGACGTCACA CTCCATTCTT CAACAACTAC 
TACAACTGAC GTTACAGGTT CAATCGAACT 
TAATGCCTGG TGATAACGTA ACTATCGACG 
GCCGTTGAAC AAGG 



2) INFORMATION FOR SEQ ID NO: 983 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 810 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus mitis 

(B) STRAIN: ATCC 4 94 56 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 983 

CTATCCTTGT AGTAGCTTCA ACTGACGGAC CAATGCCACA AACTCGTGAG 50 

CACATCCTTC TTTCACGTCA GGTTGGTGTT AAACACCTTA TCGTCTTCAT 100 

GAACAAAGTT GACTTGGTTG ACGACGAAGA ATTGCTTGAA TTGGTTGAAA 150 

TGGAAATCCG TGACCTATTG TCAGAATACG ACTTCCCAGG TGACGATCTT 200 

CCAGTTATCC AAGGTTCAGC TCTTAAAGCC CTTGAAGGTG ACACTAAATA 250 

CGAAGACATC GTTATGGAAT TGATGAACAC AGTTGATGAG TACATCCCAG 300 

AACCAGAACG TGACACTGAC AAACCATTGC TTCTTCCAGT CGAAGACGTA 350 

TTCTCAATCA CTGGTCGTGG TACAGTTGCT TCAGGACGTA TCGACCGTGG 400 

TATCGTTAAA GTCAACGACG AAATCGAAAT CGTTGGTATC AAAGAAGAAA 450 

CTCAAAAAGC AGTTGTTACT GGTGTTGAAA TGTTCCGTAA ACAACTTGAC 500 

547 



ID NO: 982 

CAATGCCACA AACTCGTGAG 50 

AAACACCTTA TCGTCTTCAT 100 

ATTGCTTGAA TTGGTTGAAA 150 

ACTTCCCAGG TGACGATCTT 200 

CTTGAAGGTG ATACTAAGTA 250 

TGTTGATGAG TACATCCCAG 300 

TTCTTCCAGT CGAAGACGTA 350 

TCAGGACGTA TCGACCGTGG 4 00 

CGTTGGTATC AAAGAAGAAA 450 

TGTTCCGTAA ACAGCTTGAC 500 

CTTCTTCGTG GTATCCAACG 550 

TAAACCAGGT TCAATCAACC 600 

TCCTTACTAA AGAAGAAGGT 650 

CGTCCACAGT TCTACTTCCG 700 

TCCAGCAGGT ACTGAAATGG 750 

TTGAGTTGAT CCACCCAATC 800 

814 
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GAAGGTCTTG CCGGAGATAA TGTAGGTGTC 
TGATGAAATC GAACGTGGAC AAGTTATTGC 
CACACACTAA ATTCAAAGGT GAAGTTTACA 
GGACGTCACA CTCCATTCTT CAACAACTAC 
TACTACTGAC GTTACAGGTT CAATCGAACT 
TAATGCCTGG TGATAACGTG ACAATCGACG 
GCCGTAGAAC 



PCT/CAOO/01150 

CTTCTTCGTG GTGTTCAACG 550 

TAAACCAGGT TCAATCAACC 6 00 

TCCTTACTAA AGAAGAAGGT 650 

CGTCCACAAT TCTACTTCCG 700 

TCCAGCAGGT ACTGAAATGG 750 

TTGAGTTGAT CCACCCAATC 800 

810 



2) INFORMATION FOR SEQ ID NO: 984 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 817 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus mitis 

(B) STRAIN: ATCC 903 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 984 

TCCTTGTAGT AGCTTCAACT GACGGACCAA TGCCACAAAC TCGTGAGCAC 50 

ATCCTTCTTT CACGTCAGGT TGGTGTTAAA CACCTTATCG TCTTCATGAA 100 

CAAGATCGAC TTGGTTGATG ACGAAGAATT GCTTGAATTG GTTGAAATGG 150 

AAATCCGTGA CCTCTTGTCA GAATACGACT TCCCAGGTGA CGATCTTCCA 2 00 

GTTATCCAAG GTTCAGCTCT TAAAGCTCTT GAAGGTGATA CTAAGTACGA 2 50 

AGACATCATC ATGGAATTGA TGAACACTGT TGATGAGTAC ATCCCAGAAC 300 

CAGAACGTGA TACTGACAAA CCTCTTCTTC TTCCAGTCGA AGACGTATTC 350 

TCAATCACTG GTCGTGGTAC AGTTGCTTCA GGACGTATCG ACCGTGGTAC 4 00 

TGTTCGTGTC AACGATGAAA TCGAAATCGT TGGTATCAAA GAAGAAATCC 450 

AAAAAGCAGT TGTTACTGGT GTTGAAATGT TCCGTAAACA GCTTGACGAA 500 

GGTCTTGCAG GGGACAACGT AGGTGTACTT CTTCGTGGTA TCCAACGTGA 550 

TGAAATCGAA CGTGGTCAAG TTATCGCTAA ACCAGGTTCA ATCAACCCAC 600 

ACACTAAATT CAAGGGTGAA GTTTACATCC TTACTAAAGA AGAAGGTGGA 650 

CGTCACACTC CATTCTTCAA CAACTACCGT CCACAGTTCT ACTTCCGTAC 700 

AACTGACGTT ACAGGTTCAA TCGAACTTCC AGCAGGTACT GAAATGGTAA 750 

TGCCTGGTGA TAACGTAACT ATCGACGTTG AGTTGATCCA CCCAATCGCC 800 

GTTGAACAAG GTACTAC 817 



2) INFORMATION FOR SEQ ID NO: 985 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 98 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus oralis 

(B) STRAIN: ATCC 35037 



548 
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WO 01/23604 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 985 



PCT/CAOO/01150 



GACGGACCAA TGCCACAAAC TCGTGAGCAC ATCCTTCTTT CACGTCAGGT 50 

TGGTGTTAAA CACCTTATTG TCTTCATGAA CAAAATTGAC TTGGTAGACG 100 

5 ACGAAGAATT GCTTGAATTG GTTGAAATGG AAATCCGTGA CCTCTTGTCA 150 

GAATACGACT TCCCAGGTGA CGATCTTCCA GTTATCCAAG GTTCAGCTCT 200 

TAAAGCTCTT GAAGGTGACT CTAAATACGA AGACATCATT ATGGAATTGA 250 

TGAACACTGT TGATGAGTAC ATCCCAGAAC CAGAACGTGA CACTGAAAAA 300 

CCATTGCTTC TTCCAGTCGA AGACGTATTC TCAATCACTG GACGTGGTAC 350 

10 AGTTGCTTCA GGACGTATCG ACCGTGGTAC TGTTCGTGTC AACGACGAAA 4 00 

TCGAAATCGT TGGTATCAAA GAAGAAACTC AAAAAGCAGT TGTTACTGGT 450 

GTTGAAATGT TCCGTAAACA ACTTGACGAA GGTCTTGCCG GAGATAACGT 500 

AGGTGTCCTT CTTCGTGGTG TTCAACGTGA CGAAATCGAA CGTGGACAAG 550 

TTATCGCTAA ACCAGGTTCA ATCAACCCAC ACACTAAATT TAAAGGTGAA 600 

15 GTCTACATCC TTACTAAAGA AGAAGGTGGA CGTCACACTC CATTCTTCAA 650 

CAACTACCGT CCACAATTCT ACTTCCGTAC TACTGACGTT ACAGGTTCAA 700 

TCGAACTTCC TGCAGGTACT GAAATGGTAA TGCCTGGTGA TAACGTGACT 750 

ATCGACGTTG AGTTGATCCA CCCAATCGCC GTAGAACAAG GTACTACA 798 



20 



35 



2) INFORMATION FOR SEQ ID NO: 986 



(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH : 815 bases 

(B) TYPE: Nucleic acid 

(C) STRAND EDNESS : Double 

(D) TOPOLOGY: Linear 

30 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-05 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 986 



TTGTAGTAGC TTCAACTGAC GGACCAATGC CACAAACTCG TGAGCACATC 50 

CTTCTTTCAC GTCAGGTTGG TGTTAAACAC CTTATCGTCT TCATGAACAA 100 

40 AGTTGACTTG GTTGACGACG AAGAATTGCT TGAATTGGTT GAAATGGAAA 150 

TCCGTGACCT ATTGTCAGAA TACGACTTCC CAGGTGACGA TCTTCCAGTT 200 

ATCCAAGGTT CAGCACTTAA AGCTCTTGAA GGTGACTCTA AATACGAAGA 250 

CATCGTTATG GAATTGATGA ACACAGTTGA TGAGTATATT CCAGAACCAG 3 00 

AACGTGACAC TGACAAACCA TTGCTTCTTC CAGTCGAGGA CGTATTCTCA 3 50 

4 5 ATCACTGGAC GTGGTACAGT TGCTTCAGGA CGTATCGACC GTGGTATCGT 4 00 

TAAAGTCAAC GACGAAATCG AAATCGTTGG TATCAAAGAA GAAACTCAAA 4 50 

AAGCAGTTGT TACTGGTGTT GAAATGTTCC GTAAACAACT TGACGAAGGT 500 

CTTGCTGGAG ATAACGTAGG TGTCCTTCTT CGTGGTGTTC AACGTGATGA 550 

AATCGAACGT GGACAAGTTA TCGCTAAACC AGGTTCAATC AACCCACACA 600 

50 CTAAATTCAA AGGTGAAGTC TACATCCTTA CTAAAGAAGA AGGTGGACGT 650 

CACACTCCAT TCTTCAACAA CTACCGTCCA CAATTCTACT TCCGTACTAC 700 

TGACGTTACA GGTTCAATCG AACTTCCAGC AGGTACTGAA ATGGTAATGC 750 

CTGGTGATAA CGTGACAATC GACGTTGAGT TGATTCACCC AATCGCCGTA 800 

GAACAAGGTA CTACA 815 

55 

2) INFORMATION FOR SEQ ID NO: 987 

60 (i) SEQUENCE CHARACTERISTICS: 

549 
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(B) 
(C) 
(D) 
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LENGTH: 832 bases 
TYPE: Nucleic acid 
STRANDEDNESS : Double 
TOPOLOGY: Linear 



10 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus hirae 

(B) STRAIN: ATCC 804 3 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 987 



CGGACAATCT 

15 TGAACACATT 
TCTTGAACAA 
GAAATGGAAG 
TACACCTGTT 
ATGCAGAAGC 

20 CCAACACCAG 
TGTATTTTCA 
GTGGGGCTGT 
GAAACGCAAA 
TGATTACGGT 

25 AAAGAGAAGA 
ACACCTCATA 
AGGCGGACGT 
TCCGCACAAC 
ATGGTCATGC 

30 TGTTGCTATT 



TGGTTGTTTC 
TTACTTTCTC 
AACAGATTTA 
TTCGTGAACT 
ATCAAAGGAT 
AGCTATCATG 
AACGTGATAC 
ATTACTGGAC 
TAGAGTCGGT 
GAGCCGTAGT 
GAAGCAGGGG 
CATCGAACGT 
CTAAATTCAA 
CATACACCAT 
TGACGTAACA 
CTGGCGACAA 
GAAAACGGGA 



TGCTACGGAT 
GCCAAGTAGG, 
GTCGATGATG 
ATTAAGCGAA 
CAGCATTAAA 
GAATTGATGG 
GGACAAACCA 
GTGGAACAGT 
GACGAAATCG 
AACAGGAGTT 
ATAACGTAGG 
GGCCAAGTGA 
AGCAGAAGTT 
TCTTCAATAA 
GGAACAATTG 
CGTAACGATC 
CGACTTTCTC 



GGCCCAATGC 
CGTGAAATAT 
AAGAATTAAT 
TATGGATTTC 
AGCATTACAA 
ACACAGTCGA 
TTATTGTTAC 
AGCATCTGGT 
AAATCGTAGG 
GAAATGTTCC 
TGTGTTATTA 
TTGCCAAACC 
TAGGTTTTGA 
TTATCGACCA 
TTTTGCCAGA 
GATGTAGAAT 
CA 



CTCAAACTCG 50 

TTGATTGTTT 100 

TGATCTAGTA 150 

CAGGTGATGA 200 

GGTGATCCTG 250 

TGAATACATC 300 

CCGCCGAAGA 350 

CGGATTGACC 4 00 

GATCAAACCA 450 

GCAAAACGCT 500 

CGTGGGATTC 550 

TGGTTCAATT 600 

CTAAAGAAGA 650 

CAATTTTATT 700 

AGGAACGGAA 750 

TGATTCATCC 800 
832 



2) INFORMATION FOR SEQ ID NO: 988 



35 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 83 5 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
40 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
45 (A) ORGANISM: Enterococcus mundtii 

(B) STRAIN: ATCC 43186 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 988 

50 TGGTGCGATC TTAGTTGTTT CTGCGACAGA TGGACCAATG CCTCAAACTC 50 

GCGAGCATAT CCTTTTATCA CGTCAAGTTG GTGTAAAATA TTTGATTGTA 100 

TTTTTGAATA AAGTTGATTT GGTCGATGAT GAAGAATTGA TCGATCTTGT 150 

AGAAATGGAA GTTCGTGAAT TACTGAATGA ATATGGTTTC CCAGGTGATG 200 

ACACACCTGT CATCAAAGGC TCTGCATTGA AAGCATTGCA AGGTGACCCA 250 

55 GAAGCAGAAG CTGCGATCAA TGAATTGATG GAAACAGTGG ATGACTATAT 300 

CCCAACACCA GAACGTGATA CCGACAAACC ATTGCTTTTA CCAGTTGAAG 350 

ATGTTTTCTC AATCACTGGT CGCGGAACGG TAGCATCTGG TCGTATCGAC 400 

CGTGGAGCCG TTCGTGTCGG TGATGAAATC GAGATCATCG GAATCAAACC 450 

TGAAACGAAA AAAGCGGTTG TGACAGGGGT AGAGATGTTC CGTAAAACGT 500 

60 TAGATTATGG CGAAGCTGGA GATAACGTAG GAATCTTGTT ACGTGGTATC 550 

550 
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CAAAGAGAAG ATATTGAACG 
CACACCACAT ACAAAATTCA 
AAGGCGGACG TCATACACCA 
TTCCGCACAA CAGATGTAAC 
AATGGTCATG CCTGGAGACA 
CAGTGGCAAT CGAACAAGGA 



TGGACAAGTA 
AAGCGGAAGT 
TTCTTCAATA 
AGGTACGATC 
ACGTAACCAT 
ACGACTTTCT 



ATTGCGAAAC CTGGTTCAAT 
TTATGTATTG ACGAAAGAAG 
ACTACCGCCC ACAATTTTAT 
GTGTTGCCAG AAGGAACAGA 
CGAGGTAGAG TTGATCCATC 
CTATT 



600 
650 
700 
750 
800 
835 
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2) INFORMATION FOR SEQ ID NO: 989 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 83 2 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE:, Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus raffinosus 

(B) STRAIN: ATCC 49427 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 989 

TGCGATCTTA GTAGTATCTA CTACTGATGG ACCAATGCCT CAAACACGTG 50 

AGCACATTTT GTTATCACGC CAAGTAGGTG TTAAATACTT GATCGTCTTC 100 

TTGAACAAAG TTGATTTAGT CGACGATGAA GAATTGATTG ATTTAGTTGA 150 

AATGGAAGTA CGTGAGTTAC TTTCAGAATA TGGTTTCCCA GGCGATGATA 200 

TTCCTGTTCT TAAAGGTTCA GCTCTGAAAG CTTTAGAAGG CGATCCTGAA 250 

CAAGAACAAG TAATCATGGA CCTAATGGAT ACGGTTGACG AATATATCCC 300 

AACACCAGAA CGTGACACTG ACAAACCATT CTTGTTACCA GTGGAAGATG 350 

TTTTCTCAAT CACAGGACGT GGGACTGTTG CATCTGGTCG TATTGATCGT 400 

GGGGAAGTTA AAGTCGGTGA CGAAGTTGAA ATTATCGGGA TCAAACCTGA 450 

AGTTCAAAAG GCTGTCGTAA CTGGACTTGA AATGTTCCGT AAAACATTGG 500 

ATTATGGTGA AGCTGGAGAT AACGTTGGGG TTCTATTACG TGGTATTACA 550 

CGTGATGAAA TCGAACGTGG TCAAGTATTA GCTAAACCAG GTTCAATTAC 600 

ACCACATACG AAATTCAGTG CAGAAGTTTA TGTGTTGACG AAAGAAGAAG 650 

GTGGACGTCA TACACCATTC TTTAACAACT ATCGTCCTCA ATTCTACTTC 700 

CGTACAACAG ACGTTACCGG TAATATCGTG TTGCCAGAAG GTACTGAAAT 750 

GGTCATGCCT GGCGATAACG TAACAATCGA CGTTGAATTA ATCCATCCAA 800 

TCGCCGTAGA AAAAGGAACA ACTTTCTCTA TT 832 



2) INFORMATION FOR SEQ ID NO: 990 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 154 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus anthracis 

(B) STRAIN: CIP 9440 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 990 
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GGATCCTGTA TATGCACAAA AACTAGGTGT 
TATCACAACC TGATACAGGG GAGCAAGGTT 
GTACGAAGTG GTGCGGTTGA TATTATCGTA 
5 TGTA 



TAACATCGAT GAATTACTAT 50 
TAGAAATCGC AGAAGCACTT 100 
ATTGACTCTG TAGCAGCTCT 150 

154 



2) INFORMATION FOR SEQ ID NO: 991 



10 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 384 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
15 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
20 (A) ORGANISM: Prevotella melaninogenica 

(B) STRAIN: ATCC 25845 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 991 

25 GCCATTGCAG AGGCACAGAA GCAGGGCGGT ATTGCAGCCT TCATTGATGC 50 

TGAGCACGCC TTCGACCGTT TCTATGCAGA GAAGTTAGGT GTGGATGTTG 100 

ATAACCTTTG GGTTTCACAG CCAGACAATG GTGAGCAGGC TTTAGAGATT 150 

GCCGACCAGC TGATTCGCTC TTCCGCTATT GACATTCTCG TTGTCGACTC 200 

AGTTGCAGCC TTGACTCCAA AGAAGGAGAT TGAGGGTGAC ATGGGTGACT 250 

30 CTGCAGTAGG TTTACAAGCA CGACTGATGA GTCAGGCATT GCGTAAACTT 3 00 

ACCTCAACAA TCGCAAAAAC TAATACTTGC TGCATCTTCA TCAACCAGTT 35 0 

GCGTGAGAAG ATTGGTGTGA TGTTTGGTAA TCCA 384 



35 



45 



2) INFORMATION FOR SEQ ID NO: 992 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 624 bases 
40 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus casseliflavus 

(B) STRAIN: R760 

50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 992 

ACACGTGAAC ACATCTTGTT ATCACGTAAC GTTGGTGTAC CATACATCGT 50 

TGTTTTCTTA AACAAAATGG ATATGGTTGA TGACGAAGAA TTACTAGAAT 100 

TAGTTGAAAT GGAAGTTCGT GACTTATTGT CAGAATATGA CTTCCCAGGC 150 

55 GACGATGTTC CTGTAATCGC TGGTTCTGCT TTGAAAGCTC TTGAAGGCGA 200 

TGCTTCATAC GAAGAAAAAA TCATGGAATT AATGGCTGCA GTTGACGAAT 250 

ACGTTCCAAC TCCAGAACGT GACACTGACA AACCATTCAT GATGCCAGTC 300 

GAAGACGTAT TCTCAATCAC TGGACGTGGT ACTGTTGCTA CAGGCCGTGT 3 50 

TGAACGTGGA CAAGTTCGCG TTGGTGACGA AGTTGAAATC GTTGGTATTG 400 

60 CTGAAGAAAC TGCTAAAACA ACTGTAACTG GTGTTGAAAT GTTCCGTAAA 450 

552 
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TTGTTAGACT ATGCTGAAGC AGGGGATAAC ATTGGTGCAT TGCTACGTGG 
TGTTGCTCGT GAAGACATCC AACGTGGACA AGTATTGGCT AAAGCTGGTA 
CAATCACACC TCATACAAAA TTTAAAGCTG AAGTTTACGT TTTAACAAAA 
GAAGAAGGTG GACGTCATAC ACCA 



500 
550 
600 
624 
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2) INFORMATION FOR SEQ ID NO: 993 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 756 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear, 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pyogenes 

(C) ACCESSION NUMBER: extracted from U40453 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 993 

ATGGAAAACA ATAAAAAAGT ATTGAAGAAA ATGGTATTTT TTGTTTTAGT 50 

GACATTTCTT GGACTAACAA TCTCGCAAGA GGTATTTGCT CAACAAGACC 100 

CCGATCCAAG CCAACTTCAC AGATCTAGTT TAGTTAAAAA CCTTCAAAAT 150 

ATATATTTTC TTTATGAGGG TGACCCTGTT ACTCACGAGA ATGTGAAATC 2 00 

TGTTGATCAA CTTTTATCTC ACGATTTAAT ATATAATGTT TCAGGGCCAA 2 50 

ATTATGATAA ATTAAAAACT GAACTTAAGA ACCAAGAGAT GGCAACTTTA 3 00 

TTTAAGGATA AAAACGTTGA TATTTATGGT GTAGAATATT ACCATCTCTG 350 

TTATTTATGT GAAAATGCAG AAAGGAGTGC ATGTATCTAC GGAGGGGTAA 4 00 

CAAATCATGA AGGGAATCAT TTAGAAATTC CTAAAAAGAT AGTCGTTAAA 450 

GTATCAATCG ATGGTATCCA AAGCCTATCA TTTGATATTG AAACAAATAA 500 

AAAAATGGTA ACTGCTCAAG AATTAGACTA TAAAGTTAGA AAATATCTTA 550 

CAGATAATAA GCAACTATAT ACTAATGGAC CTTCTAAATA TGAAACTGGA 600 

TATATAAAGT TCATACCTAA GAATAAAGAA AGTTTTTGGT TTGATTTTTT 650 

CCCTGAACCA GAATTTACTC AATCTAAATA TCTTATGATA TATAAAGATA 7 00 

ATGAAACGCT TGACTCAAAC ACAAGCCAAA TTGAAGTCTA CCTAACAACC 750 

AAGTAA 756 



2) INFORMATION FOR SEQ ID NO: 994 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 994 
TGGACTAACA ATCTCGCAAG AGG 23 



2) INFORMATION FOR SEQ ID NO: 995 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 995 
ACATTCTCGT GAGTAACAGG GT 



2) INFORMATION FOR SEQ ID NO: 996, 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 996 
ACAAATCATG AAGGGAATCA TTTAG 



2) INFORMATION FOR SEQ ID NO: 997 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 997 
CTAATTCTTG AGCAGTTACC ATT 



2) INFORMATION FOR SEQ ID NO: 998 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 998 
GGAGGGGTAA CAAATCATGA AGG 
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2) INFORMATION FOR SEQ ID NO: 999 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

10 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 999 
15 TTGACCTTGT TGATGACGAA GAG - 23 



2) INFORMATION FOR SEQ ID NO: 1000 



20 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
25 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 100 0 

30 

TTAGTGTGTG GGTTGATTGA ACT 23 



35 2) INFORMATION FOR SEQ ID NO: 1001 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1001 

AAGAGTTGCT TGAATTAGTT GAG 23 



50 

2) INFORMATION FOR SEQ ID NO: 1002 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 894 bases 
55. (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : * Double 

(D) TOPOLOGY: Linear 



60 



(ii) MOLECULE TYPE: Genomic DNA 

555 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pyogenes 

(B) STRAIN: ATCC 7002 94 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1002 



AACATGATCA 
TTCAACTGAT 
GTCAGGTTGG 
GTTGATGACG 
TCTTTCAGAA 
CAGCTCTTAA 
GAATTGATGG 
TGACAAACCA 
GTGGTACAGT 
GACGAAATCG 
TACTGGTGTT 
ACAACGTAGG 
GGTCAAGTTA 
AGGTGAAGTA 
TCTTCAACAA 
GGTTCAATCG 
CGTGACAATC 
CTACTTTCTC 



CTGGTGCCGC 
GGACCAATGC 
TGTTAAACAC 
AAGAGTTGCT 
TACGATTTCC 
AGCTCTTGAA 
ATACTGTTGA 
TTGCTTCTTC 
TGCTTCAGGA 
AAATCGTTGG 
GAAATGTTCC 
TATCCTTCTT 
TTGCTAAACC 
TATATCCTTT 
CTACCGTCCA 
AACTTCCAGC 
AACGTTGAGT 
AATCCGTGAA 



TCAAATGGAC GGAGCTATCC 
CACAAACTCG TGAGCACATC 
CTTATCGTGT TCATGAACAA 
TGAATTAGTT GAGATGGAAA 
CAGGTGATGA CCTTCCAGTT 
GGCGACACTA AATTTGAAGA 
TTCATACATT CCAGAACCAG 
CAGTCGAAGA CGTATTCTCA 
CGTATCGACC, GTGGTACTGT 
TATCAAAGAA GAAACTAAAA 
GTAAACAACT TGACGAAGGT 
CGTGGTGTTC AACGTGACGA 
AAGTTCAATC AACCCACACA 
CTAAAGACGA AGGTGGACGT 
CAATTCTACT TCCGTACAAC 
AGGTACAGAA ATGGTTATGC 
TGATCCACCC AATCGCCGTA 
GGTGGACGTA CTGTTGGTTC 



TTGTAGTTGC 
CTTCTTTCAC 
AGTTGACCTT 
TTCGTGACCT 
ATCCAAGGTT 
CATCATCATG 
AACGCGACAC 
ATTACAGGTC 
TCGTGTCAAC 
AAGCTGTTGT 
CTTGCAGGAG 
AATCGAACGT 
CTAAATTCAA 
CACACTCCAT 
TGACGTAACA 
CTGGTGATAA 
GAACAAGGTA 
AGGT 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
850 
894 



2) INFORMATION FOR SEQ ID NO: 1003 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 332 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus cereus 

(B) STRAIN: ATCC 14579 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1003 

AATGGATCCT GTATATGCAC AAAAATTAGG CGTTAACATA GATGAATTAC 50 

TATTATCACA GCCTGATACA GGGGAGCAAG GATTAGAAAT CGCGGAAGCA 10 0 

CTTGTACGAA GTGGTGCGGT TGACATTATC GTAATTGACT CTGTAGCAGC 150 

TCTTGTACCG AAAGCAGAGA TTGAAGGCGA CATGGGTGAC TCACACGTAG 200 

GTTTACAAGC ACGTTTAATG TCACAAGCAC TTCGTAAGCT TTCAGGAGCA 250 

ATCAACAAAT CAAAAACAAT TGCAATCTTT ATTAACCAAA TTCGTGAAAA 3 00 

AGTTGGGGTT ATGTTCGGAA ACCCAGAAAC AA 332 



2) INFORMATION FOR SEQ ID NO: 1004 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1212 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-01 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1004 



10 ACCAAGAAGC 
GTCTCTTACC 
TTCAAATGGT 
ACGTTTCATT 
TCTGCTATGA 

15 TTATGATTCC 
GAACAAGCAC 
ACTCTGCAAT 
TTTGAATAAG 
GTATCGACTA 

20 ACAGAATCTA 
TTATGCTGCC 
ATAAAGTCGT 
ACTCGTGCCA 
AACAGTCTTG 

25 CTCAAGCTGG 
AACCACATCA 
TTATACTCGT 
TAACTCCTAT 
TCAATGATAA 

30 GCCAGACGGA 
GCCCAATATG 
TCAAGCTCAT 
AAGCACAAAT 
CAAATACAAC 

35 



TCAAAAACAT 
CTGACGATGA 
AAAGTCATCG 
TGGTACCAAC 
AACCAATCAC 
ACTGCAACTA 
ACCTGTCTAC 
ATGCTCTTCA 
GTCGGTCTAG 
TCCAAGCATG 
ATAAACAATA 
TTTGCAAATG 
CTTCAGTGAT 
TGAAGGAAAC 
ACTTATGGAA 
TAAAACAGGA 
AGAACACTGG 
AAGTATTCTA 
CGTTGGAGAT 
CGTATCTATC 
CTTTTCAGAA 
GACTGAACCC 
CATCAGATAG 
AATAGTACGA 
CC 



CTGTGGGATA 
TTTGCAAGTC 
CCCAACTTGG 
CAAGCTGTGG 
CGATTATGCA 
TGGTTAATGA 
AACTGGGATA 
ACAATCACGA 
ATAGAGCTAA 
CATTATGCAA 
CGGAGCAAGT 
GTGGCACTTA 
GGAAGTAAAA 
GACAGCCTAT 
CTGGGCGTGG 
ACCTCTAACT 
CTATGTAGCT 
TGGCTGTATG 
GGTTTCCTAG 
AGAAGATACT 
ACGGGGAATT 
TCTACTCAAC 
TTCAACTTCA 
CTACCAATCC 



TCTACAACTC 
GCATCTACGG 
AGCTCGTCAC 
AAACCAATCG 
CCTGCCATAG 
TATTCCTTAT 
GAGCATATTT 
AATGTCACAG 
AACCTTCCTT 
ACGCCATTTC 
AGTGAAAAAA 
CTATAAACCA 
AAGAGTTCTC 
ATGATGACCG 
AGCCTATCTT 
ATACAGATGA 
CCAGATGAAA 
GACAGGTTAT 
TTGCAGCTAA 
CATCCAGAAG 
TGTATTCAAA 
AATCCTCAAC 
CAGTCTAGCT 
TAACAATAAT 



CGATCAATAC 50 

TCGTAGATGT 100 

CAAGCAAGTA 150 

TGACTGGGGT 200 

AATACGGTGT 250 

AACTATCCGG 300 

CGGTAATATT 350 

CCGTTGAGAC 4 00 

AATGGTCTTG 450 

AAGTAATACA 500 

TGGCTGCTGC 550 

ATGTATATCC 600 

TAATGTCGGA 650 

ACATGATGAA 700 

CCTTGGCTTC 750 

GGAAGTTGAA 800 

TGTTTGTTGG 850 

TCGAATCGTT 900 

AGTTTATCGC 950 

ACTGGACGAT 1000 

AATGGAGCTC 1050 

AGCTGAAAGT 1100 

CAACCACTCC 1150 
ACGCAACAAT ' 1200 
1212 



2) INFORMATION FOR SEQ ID NO: 1005 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1212 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



45 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 
50 (B) STRAIN: StrR-02 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1005 

AAGCTCAAAA ACATCTGTGG GATATCTACA ACTCCGATCA ATACGTCTCT 50 

55 TACCCTGACG ATGATTTGCA AGTCGCATCT ACGGTCGTAG ATGTTTCAAA 100 

TGGTAAAGTC ATCGCCCAAC TTGGAGCTCG TCACCAAGCA AGTAACGTTT 150 

CATTTGGTAC CAACCAAGCT GTGGAAACCA ATCGTGACTG GGGTTCTGCT 200 

ATGAAACCAA TCACCGATTA TGCACCTGCC ATAGAATACG GTGTTTATGA 250 

TTCCACTGCA ACTATGGTTA ATGATATTCC TTATAACTAT CCGGGAACAA 300 

60 GCACACCTGT CTACAACTGG GATAGAGCAT ATTTCGGTAA TATTACTCTG 350 
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CAATATGCTC 
TAAGGTCGGT 
ACTATCCAAG 
TCTAATAAAC 
5 TGCCTTTGCC 
TCGTCTTCAG 
GCTATGAAAG 
CTTGGCATAC 
CTGGTAAGAC 

10 ATCAAGAACA 
TCGTAAGTAT 
CTATCGTTGG 
ATAACGTATC 
GGGGCTCTAC 

15 CGTGGAGCTC 
TCATCATCAG 
AAATAATAGT 
CAACCCCTGA 



TTCAACAATC 
CTAGATAGAG 
CATGCATTAT 
AATACGGAGC 
AACGGTGGAA 
TGACGGTAGT 
AAACAACTGC 
GGAACTGGTC 
AGGTACTTCT 
CTGGCTATGT 
TCTATGGCTG 
AGATGGTTTC 
TGTCTGAAGG 
AGAAATGGAG 
ACCTGCTCCA 
ATAGTTCAAC 
ACGACTACCA 
TC 



ACGAAATGTC 

CTAAAACCTT 

GCAAACGCCA 

AAGTAGTGAA 

TTTACCACAA 

AAAAAAGAAT 

TTACATGATG 

GTGGAGCCTA 

AACTACACAG 

AGCTCCAGAT 

TATGGACAGG 

CTAGTTGCAG 

AAGCAATCCA 

AATTCGTATT 

CAACAACCCC, 

TTCACAGTCT 

ATCCTAACAA 



ACAGCCGTTG 
CCTTAATGGT 
TTTCAAGTAA 
AAAATGGCTG 
ACCCATGTAT 
TTTCAGATGT 
ACCGAAATGA 
TCTCCCATGG 
ATGATGAAAT 
GAAATGTTTG 
TTATTCGAAT 
CTAAAGTTTA 
GAGGATTGGA 
TAAAAATGGT 
CATCAACTGA 
AACTCAACCA 
TAATACGCAA 



PCT/CAOO/01150 

AGACTTTGAA 4 00 

CTTGGTATCG 450 

TACAACAGAA 500 

CTGCTTATGC 550 

ATCAATAAGG 600 

AGGTACACGA 650 

TGAAAACTGT 700 

TTAGCGCAAG 750 

TGAAAAACAC 800 

TTGGTTATAC 850 

CGTTTAACTC 900 

TCGCTCAATG 950 

ATATACCAGA 1000 

GCTCGTTCTA 1050 

AAGTTCAAGC 1100 

CTCCAAGCAC 1150 

CAATCAAATA 1200 
1212 



20 



25 



30 



35 



2) INFORMATION FOR SEQ ID NO: 1006 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1213 bases 

(B) TYPE: Nucleic acid 

(C) STRAND EDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-03 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1006 



ACCAAGAAGC 
GTCTCTTACC 

40 TTCAAATGGT 
ACGTTTCATT 
TCTGCTATGA 
TTATGATTCC 
GAACAAGCAC 

45 ACTCTGCAAT 
TTTGAATAAG 
GTATCGACTA 
ACAGAATCTA 
TTATGCTGCC 

50 ATAAGGTCGT 
ACACGAGCTA 
AACTGTCTTG 
CGCAAGCTGG 
AAACACATCA 

55 TTATACTCGT 
TAACTCCTAT 
TCAATGATAA 
ACCAGAGGGG 
GTTCTACGTG 

60 TCAAGCTCAT 



TCAAAAACAT 
CTGACGATGA 
AAAGTCATCG 
TGGTACCAAC 
AACCAATCAC 
ACTGCAACTA 
ACCTGTCTAC 
ATGCTCTTCA 
GTCGGTCTAG 
TCCAAGCATG 
ATAAACAATA 
TTTGCCAACG 
CTTCAGTGAC 
TGAAAGAAAC 
GCATACGGAA 
TAAGACAGGT 
AGAACACTGG 
AAGTATTCTA 
CGTTGGAGAT 
CGTATCTGTC 
CTCTACAGAA 
GAGCTCACCT 
CATCAGATAG 



CTGTGGGATA 
TTTGCAAGTC 
CCCAACTTGG 
CAAGCTGTGG 
CGATTATGCA 
TGGTTAATGA 
AACTGGGATA 
ACAATCACGA 
ATAGAGCTAA 
CATTATGCAA 
CGGAGCAAGT 
GTGGAATTTA 
GGTAGTAAAA 
AACTGCTTAC 
CTGGTCGTGG 
ACTTCTAACT 
CTATGTAGCT 
TGGCTGTATG 
GGTTTCCTAG 
TGAAGGAAGC 
ATGGAGAATT 
GCTCCACAAC 
TTCAACTTCA 



ID NO: 1006 




TCTACAACTC 


CGATCAATAC 


50 


GCATCTACGG 


TCGTAGATGT 


100 


AGCTCGTCAC 


CAAGCAAGTA 


150 


AAACCAATCG 


TGACTGGGGT 


200 


CCTGCCATAG 


AATACGGTGT 


250 


TATTCCTTAT 


AACTATCCGG 


300 


GAGCATATTT 


CGGTAATATT 


350 


AATGTCACAG 


CCGTTGAGAC 


400 


AACCTTCCTT 


AATGGTCTTG 


450 


ACGCCATTTC 


AAGTAATACA 


500 


AGTGAAAAAA 


TGGCTGCTGC 


550 


CCACAAACCC 


ATGTATATCA 


600 


AAGAATTTTC 


AGATGTAGGT 


650 


ATGATGACCG 


AAATGATGAA 


700 


AGCCTATCTC 


CCATGGTTAG 


750 


ACACAGATGA 


TGAAATTGAA 


800 


CCAGATGAAA 


TGTTTGTTGG 


850 


GACAGGTTAT 


TCGAATCGTT 


900 


TTGCAGCTAA 


AGTTTATCGC 


950 


AATCCAGAGG 


ATTGGAATAT 


1000 


CGTATTTAAA 


AATGGTGCTC 


1050 


AACCCCCATC 


AACTGAAAGT 


1100 


CAGTCTAACT 


CAACCACTCC 


1150 
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AAGCACAAAT AATAGTACGA CTACCAATCC TAACAATAAT ACGCAACAAT 1200 
CAAATACAAC CCC 1213 



2) INFORMATION FOR SEQ ID NO: 1007 



10 



15 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1218 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: S.trR-04 



20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1007 



ACCAAGAAGC 
GTCTCTTACC 
TTCAAATGGT 

25 ACGTTTCATT 
TCTGCTATGA 
TTATGATTCC 
GAACAAGCAC 
ACTCTGCAAT 

30 TTTGAATAAG 
GTATCGACTA 
ACAGAATCTA 
TTATGCTGCC 
ATAAAGTCGT 

35 ACTCGTGCCA 
AACAGTCTTG 
CTCAAGCTGG 
AACCACATCA 
TTATACTCGT 

40 TAACTCCTAT 
TCAATGATAA 
GCCAGACGGA 
GCCCAATATG 
TCAAGCTCAT 

4 5 AAGCACAAAT 
CAAATACAAC 



TCAAAAACAT 
CTGACGATGA 
AAAGTCATCG 
TGGTACCAAC 
AACCAATCAC 
ACTGCAACTA 
ACCTGTCTAC 
ATGCTCTTCA 
GTCGGTCTAG 
TCCAAGCATG 
ATAAACAATA 
TTTGCAAATG 
CTTCAGTGAT 
TGAAGGAAAC 
ACTTATGGAA 
TAAAACAGGA 
AGAACACTGG 
AAGTATTCTA 
CGTTGGAGAT 
CGTATCTATC 
CTTTTCAGAA 
GACTGAACCC 
CATCAGATAG 
AATAGTACGA 
CCCTGATC 



CTGTGGGATA 
TTTGCAAGTC 
CCCAACTTGG 
CAAGCTGTGG 
CGATTATGCA 
TGGTTAATGA 
AACTGGGATA 
ACAATCACGA 
ATAGAGCTAA 
CATTATGCAA 
CGGAGCAAGT 
GTGGCACTTA 
GGAAGTAAAA 
GACAGCCTAT 
CTGGGCGTGG 
ACCTCTAACT 
CTATGTAGCT 
TGGCTGTATG 
GGTTTCCTAG 
AGAAGATACT 
ACGGGGAATT 
TCTACTCAAC 
TTCAACTTCA 
CTACCAATCC 



TCTACAACTC 
GCATCTACGG 
AGCTCGTCAC 
AAACCAATCG 
CCTGCCATAG 
TATTCCTTAT 
GAGCATATTT 
AATGTCACAG 
AACCTTCCTT 
ACGCCATTTC 
AGTGAAAAAA 
CTATAAACCA 
AAGAGTTCTC 
ATGATGACCG 
AGCCTATCTT 
ATACAGATGA 
CCAGATGAAA 
GACAGGTTAT 
TTGCAGCTAA 
CATCCAGAAG 
TGTATTCAAA 
AATCCTCAAC 
CAGTCTAGCT 
TAACAATAAT 



CGATCAATAC 50 

TCGTAGATGT 100 

CAAGCAAGTA 150 

TGACTGGGGT 2 00 

AATACGGTGT 250 

AACTATCCGG 300 

CGGTAATATT 350 

CCGTTGAGAC 4 00 

AATGGTCTTG 450 

AAGTAATACA 500 

TGGCTGCTGC 550 

ATGTATATCC 600 

TAATGTCGGA 650 

ACATGATGAA 700 

CCTTGGCTTC 750 

GGAAGTTGAA 800 

TGTTTGTTGG 850 

TCGAATCGTT 900 

AGTTTATCGC 950 

ACTGGACGAT 1000 

AATGGAGCTC 1050 

AGCTGAAAGT 1100 

CAACCACTCC 1150 

ACGCAACAAT 1200 
1218 



50 2) INFORMATION FOR SEQ ID NO: 1008 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1223 bases 

(B) TYPE: Nucleic acid 
55 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
60 (vi) ORIGINAL SOURCE: 
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(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-05 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1008 

GTAGACCAAG AAGCTCAAAA ACATCTGTGG GATATCTACA ACTCCGATCA 50 

ATACGTCTCT TACCCTGACG ATGATTTGCA AGTCGCATCT ACGGTCGTAG 100 

ATGTTTCAAA TGGTAAAGTC ATCGCCCAAC TTGGAGCTCG TCACCAAGCA 150 

AGTAACGTTT CATTTGGTAC CAACCAAGCT GTGGAAACCA ATCGTGACTG 200 

GGGTTCTGCT ATGAAACCAA TCACCGATTA TGCACCTGCC ATAGAATACG 250 

GTGTTTATGA TTCCACTGCA ACTATGGTTA ATGATATTCC TTATAACTAT 300 

CCGGGAACAA GCACACCTGT CTACAACTGG GATAGAGCAT ATTTCGGTAA 350 

TATTACTCTG CAATATGCTC TTCAACAATC ACGAAATGTC ACAGCCGTTG 400 

AGACTTTGAA TAAGGTCGGT CTAGATAGAG CTAAAACCTT CCTTAATGGT 4 50 

CTTGGTATCG ACTATCCAAG CATGCATTAT, GCAAACGCCA TTTCAAGTAA 500 

TACAACAGAA TCTAATAAAC AATACGGAGC AAGTAGTGAA AAAATGGCTG 550 

CTGCTTATGC TGCCTTTGCA AATGGTGGCA CTTACTATAA ACCAATGTAT 600 

ATCCATAAAG TCGTCTTCAG TGATGGAAGT AAAAAAGAGT TCTCTAATGT 650 

CGGAACTCGT GCCATGAAGG AAACGACAGC CTATATGATG ACCGACATGA 700 

TGAAAACAGT CTTGACTTAT GGAACTGGGC GTGGAGCCTA TCTTCCTTGG 750 

CTTCCTCAAG CTGGTAAAAC AGGAACCTCT AACTATACAG ATGAGGAAGT 800 

TGAAAACCAC ATCAAGAACA CTGGCTATGT AGCTCCAGAT GAAATGTTTG 850 

TTGGTTATAC TCGTAAGTAT TCTATGGCTG TATGGACAGG TTATTCGAAT 900 

CGTTTAACTC CTATCGTTGG AGATGGTTTC CTAGTTGCAG CTAAAGTTTA 950 

TCGCTCAATG ATAACGTATC TATCAGAAGA TACTCATCCA GAAGACTGGA 1000 

CGATGCCAGA CGGACTTTTC AGAAACGGGG AATTTGTATT CAAAAATGGA 1050 

GCTCGCCCAA TATGGACTGA ACCCTCTACT CAACAATCCT CAACAGCTGA 1100 

AAGTTCAAGC TCATCATCAG ATAGTTCAAC TTCACAGTCT AGCTCAACCA 1150 

CTCCAAGCAC AAATAATAGT ACGACTACCA ATCCTAACAA TAATACGCAA 1200 

CAATCAAATA CAACCCCTGA TCA 1223 



2) INFORMATION FOR SEQ ID NO: 1009 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1214 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 
<B) STRAIN: StrR-06 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1009 

ACCAAGAAGC TCAAAAACAT CTGTGGGATA TCTACAACTC CGATCAATAC 50 

GTCTCTTACC CTGACGATGA TTTGCAAGTC GCATCTACGG TCGTAGATGT 100 

TTCAAATGGT AAAGTCATCG CCCAACTTGG AGCTCGTCAC CAAGCAAGTA 150 

ACGTTTCATT TGGTACCAAC CAAGCTGTGG AAACCAATCG TGACTGGGGT 200 

TCTGCTATGA AACCAATCAC CGATTATGCA CCTGCCATAG AATACGGTGT 250 

TTATGATTCC ACTGCAACTA TGGTTAATGA TATTCCTTAT AACTATCCGG 300 

GAACAAGCAC ACCTGTCTAC AACTGGGATA GAGCATATTT CGGTAATATT 3 50 

ACTCTGCAAT ATGCTCTTCA ACAATCACGA AATGTCACAG CCGTTGAGAC 4 00 

TTTGAATAAG GTCGGTCTAG ATAGAGCTAA AACCTTCCTT AATGGTCTTG 450 

GTATCGACTA TCCAAGCATG CATTATGCAA ACGCCATTTC AAGTAATACA 500 

ACAGAATCTA ATAAACAATA CGGAGCAAGT AGTGAAAAAA TGGCTGCTGC 550 
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TTATGCTGCC TTTGCAAATG GTGGCACTTA 
ATAAAGTCGT CTTCAGTGAT GGAAGTAAAA 
ACTCGTGCCA TGAAGGAAAC GACAGCCTAT 
AACAGTCTTG ACTTATGGAA CTGGGCGTGG 
5 CTCAAGCTGG TAAAACAGGA ACCTCTAACT 
AACCACATCA AGAACACTGG CTATGTAGCT 
TTATACTCGT AAGTATTCTA TGGCTGTATG 
TAACTCCTAT CGTTGGAGAT GGTTTCCTAG 
TCAATGATAA CGTATCTATC AGAAGATACT 
10 GCCAGACGGA CTTTTCAGAA ACGGGGAATT 
GCCCAATATG GACTGAACCC TCTACTCAAC 
TCAAGCTCAT CATCAGATAG TTCAACTTCA 
AAGCACAAAT AATAGTACGA CTACCAATCC 
CAAATACAAC CCCT 

15 



PCT/CA00/01150 

CTATAAACCA ATGTATATCC 600 

AAGAGTTCTC TAATGTCGGA 650 

ATGATGACCG ACATGATGAA 700 

AGCCTATCTT CCTTGGCTTC 750 

ATACAGATGA GGAAGTTGAA 800 

CCAGATGAAA TGTTTGTTGG 850 

GACAGGTTAT TCGAATCGTT 900 

TTGCAGCTAA AGTTTATCGC 950 

CATCCAGAAG ACTGGACGAT 1000 

TGTATTCAAA AATGGAGCTC 1050 

AATCCTCAAC AGCTGAAAGT 1100 

CAGTCTAGCT CAACCACTCC 1150 

TAACAATAAT ACGCAACAAT 1200 

1214 



2) INFORMATION FOR SEQ ID NO: 1010 



20 



25 



<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1223 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



30 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-07 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1010 



CCAAGAAGCT 

3 5 TCTCTTACCC 

TCAAATGGTA 
TGTTTCATTC 
CATCAATGAA 
TATGACTCTA 

4 0 CACTGATACT 

CAATCCAGTA 
TTGAATAAGG 
TATCGACTAT 
CTGAATCCAA 

45 TACGCTGCTT 
TAAAATCGTC 
CACGAGCTAT 
ACTGTTTTAA 
ACAAGCAGGT 

50 AGTATATCAA 
TATACCCGTA 
AACTCCAATC 
CAATGATAAC 
CCAGATGGCT 

55 TTCTACGTGG 
CAAGCTCATC 
AGCACAAATA 
AAATACAACC 



CAAAAACATC 
TGACGATGAT 
AAGTCATCGC 
GGTACCAACC 
ACCAATCACT 
CTGCTTCTAT 
CCACTCTACA 
TGCTCTTCAA 
TCGGTCTAGA 
CCAAGCATGC 
CAAAAAATAT 
TTGCTAATGG 
TTTAGTGATG 
GAAAGAGACT 
CTTACGGAAC 
AAGACAGGTA 
GAACACTGGT 
AATATGCAAT 
ATCGGAGATG 
TTACCTTTCT 
TGTATAGAAA 
AGCTCACCTG 
ATCAGATAGT 
ATAGTACGAC 
CCTGATCAAC 



TGTGGGATAT 
TTGCAAGTCG 
ACAACTTGGT 
AGGCCGTAGA 
GACTATGCTC 
TGTACATGAT 
ACTGGGATCA 
CAATCACGAA 
TAGAGCTAAA 
ATTATGCAAA 
GGTGCAAGTA 
TGGTATTTAT 
GTAGCGAAAA 
ACTGCCTATA 
AGGACGTGGA 
CTTCTAACTA 
TACGTAGCTC 
GGCTGTTTGG 
GTTTCCTTGT 
GAAGATGACC 
TGGAGAATTC 
CTCCACAACA 
TCAACTTCAC 
TACCAATCCT 
AAA 



CTACAACTCC 
CATCTACGGT 
GCTCGTCATC 
AACCAATCGT 
CCGCTTTAGA 
GTCCCTTATA 
TGTCTACTTT 
ATGTCACAGC 
ACCTTCCTTA 
CGCCATTTCA 
GTGAAAAAAT 
CACAAACCAA 
AGAATTTTCT 
TGATGACTGA 
GCCTACCTAC 
TACTGACGAA 
CAGATGAAAT 
ACAGGATACT 
TGCTGGTAAA 
AACCTGGAGA 
GTATTTAAAA 
ACCCCCATCA 
AGTCTAACTC 
AACAATAATA 



GATCAATACG 50 

CGTAGATGTT 100 

AAGCAAGTAA 15 0 

GACTGGGGAT 200 

ATATGGAGTC 250 

ACTATCCTGG 300 

GGAAACATTA 350 

CGTTGAGACT 4 00 

ATGGTCTTGG 4 50 

AGTAACACAA 500 

GGCTGCTGCC 550 

TGTATATCAA 600 

GATGCTGGTA 650 

AATGATGAAA 700 

CATGGCTTCC 750 

GAAATTGAAA 800 

GTTTGTAGGG 850 

CAAATCGTCT 900 

GTCTATCGTT 950 

TTGGACAATG 1000 

ATGGTGCTCG 1050 

ACTGAAAGTT 1100 

AACCACTCCA 1150 

CGCAACAATC 1200 
1223 
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2) INFORMATION FOR SEQ ID NO: 1011 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 07 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-08 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1011 

AAGCTCAAAA ACATCTGTGG GATATCTACA ACTCCGATCA ATACGTCTCT 50 

TACCCTGACG ATGATTTGCA AGTCGCATCT ACGGTCGTAG ATGTTTCAAA 100 

TGGTAAAGTC ATCGCCCAAC TTGGAGCTCG TCACCAAGCA AGTAACGTTT 150 

CATTTGGTAC CAACCAAGCT GTGGAAACCA ATCGTGACTG GGGTTCTGCT 200 

ATGAAACCAA TCACCGATTA TGCACCTGCC ATAGAATACG GTGTTTATGA 250 

TTCCACTGCA ACTATGGTTA ATGATATTCC TTATAACTAT CCGGGAACAA 300 

GCACACCTGT CTACAACTGG GATAGAGCAT ATTTCGGTAA TATTACTCTG 350 

CAATATGCTC TTCAACAATC ACGAAATGTC ACAGCCGTTG AGACTTTGAA 4 00 

TAAGGTCGGT CTAGATAGAG CTAAAACCTT CCTTAATGGT CTTGGTATCG 450 

ACTATCCAAG CATGCATTAT GCAAACGCCA TTTCAAGTAA TACAACAGAA 500 

TCTAATAAAC AATACGGAGC AAGTAGTGAA AAAATGGCTG CTGCTTATGC 550 

TGCCTTTGCA AATGGTGGCA CTTACTATAA ACCAATGTAT ATCCATAAAG 600 

TCGTCTTCAG TGATGGAAGT AAAAAAGAGT TCTCTAATGT CGGAACTCGT 650 

GCCATGAAGG AAACGACAGC CTATATGATG ACCGACATGA TGAAAACAGT 700 

CTTGACTTAT GGAACTGGGC GTGGAGCCTA TCTTCCTTGG CTTCCTCAAG 750 

CTGGTAAAAC AGGAACCTCT AACTATACAG ATGAGGAAGT TGAAAACCAC 800 

ATCAAGAACA CTGGCTATGT AGCTCCAGAT GAAATGTTTG TTGGTTATAC 850 

TCGTAAGTAT TCTATGGCTG TATGGACAGG TTATTCGAAT CGTTTAACTC 900 

CTATCGTTGG AGATGGTTTC CTAGTTGCAG CTAAAGTTTA TCGCTCAATG 950 

ATAACGTATC TATCAGAAGA TACTCATCCA GAAGACTGGA CGATGCCAGA 1000 

CGGACTTTTC AGAAACGGGG AATTTGTATT CAAAAATGGA GCTCGCCCAA 1050 

TATGGACTGA ACCCTCTACT CAACAATCCT CAACAGCTGA AAGTTCAAGC 1100 

TCATCATCAG ATAGTTCAAC TTCACAGTCT AGCTCAACCA CTCCAAGCAC 1150 

AAATAATAGT ACGACTACCA ATCCTAACAA TAATACGCAA CAATCAAATA 1200 

CAACCCC 1207 



2 ) INFORMATION FOR SEQ ID NO: 1012 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1201 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-09 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1012 
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WO 01/23604 



PCT/CA00/01150 



AAGCTCAAAA 
TACCCTGACG 
TGGTAAAGTC 
5 CATTTGGTAC 
ATGAAACCAA 
TTCCACTGCA 
GCACACCTGT 
CAATATGCTC 

10 TAAGGTCGGT 
ACTATCCAAG 
TCTAATAAAC 
TGCCTTTGCC 
TCGTCTTCAG 

15 GCTATGAAAG 
CTTGGCATAC 
CTGGTAAGAC 
ATCAAGAACA 
TCGTAAGTAT 

20 CTATCGTTGG 
ATAACGTATC 
CGGACTTTTC 
CGTGGAACTC 
TCATCATCAG 

25 AAATAATAGT 
C 



ACATCTGTGG 
ATGATTTGCA 
ATCGCCCAAC 
CAACCAAGCT 
TCACCGATTA 
ACTATGGTTA 
CTACAACTGG 
TTCAACAATC 
CTAGATAGAG 
CATGCATTAT 
AATACGGAGC 
AACGGTGGAA 
TGACGGTAGT 
AAACAACTGC 
GGAACTGGTC 
AGGTACTTCT 
CTGGCTATGT, 
TCTATGGCTG 
AGATGGTTTC 
TATCAGAAGA 
AGAAACGGGG 
ACCTGCTCCA 
ATAGTTCAAC 
ACGACTACCG 



GATATCTACA 
AGTCGCATCT 
TTGGAGCTCG 
GTGGAAACCA 
TGCACCTGCC 
ATGATATTCC 
GATAGAGCAT 
ACGAAATGTC 
CTAAAACCTT 
GCAAACGCCA 
AAGTAGTGAA 
TTTACCACAA 
AAAAAAGAAT 
TTACATGATG 
GTGGAGCCTA 
AACTACACAG 
AGCTCCAGAT 
TATGGACAGG 
CTAGTTGCAG 
TACTCATCCA 
AATTTGTATT 
CAACAACCCC 
TTCACAGTCT 
ATCCTAACAA 



ACTCCGATCA 
ACGGTCGTAG 
TCACCAAGCA 
ATCGTGACTG 
ATAGAATACG 
TTATAACTAT 
ATTTCGGTAA 
ACAGCCGTTG 
CCTTAATGGT 
TTTCAAGTAA 
AAAATGGCTG 
ACCCATGTAT 
TTTCAGATGT 
ACCGAAATGA 
TCTCCCATGG 
ATGATGAAAT 
GAAATGTTTG 
TTATTCGAAT 
CTAAAGTTTA 
GAAGACTGGA 
CAAAAATGGA 
CATCAACTGA 
AGCTCAACCA 
TAATACGCAA 



ATACGTCTCT 50 

ATGTTTCAAA 100 

AGTAACGTTT 150 

GGGTTCTGCT 200 

GTGTTTATGA 250 

CCGGGAACAA 300 

TATTACTCTG 350 

AGACTTTGAA 4 00 

CTTGGTATCG 450 

TACAACAGAA 500 

CTGCTTATGC 550 

ATCAATAAGG 600 

AGGTACACGA 650 

TGAAAACTGT 700 

TTAGCGCAAG 750 

TGAAAAACAC 800 

TTGGTTATAC 850 

CGTTTAACTC 900 

TCGCTCAATG 950 

CGATGCCAGA 1000 

GCTCGTTCTA 1050 

AAGTTCAAGC 1100 

CTCCAAGCAC 1150 

CAATCAAATA 1200 
1201 



30 2) INFORMATION FOR SEQ ID NO: 1013 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1220 bases 

(B) TYPE: Nucleic acid 
35 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

40 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-10 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1013 



TGTAGACCAA GAAGCTCAAA AACATCTGTG GGATATCTAC AACTCCGATC 50 

AATACGTCTC TTACCCTGAC GATGATTTGC AAGTCGCATC TACGGTCGTA 100 

GATGTTTCAA ATGGTAAAGT CATCGCACAA CTTGGTGCTC GTCATCAAGC 150 

AAGTAATGTT TCATTCGGTA CCAACCAGGC CGTAGAAACC AATCGTGACT 2 00 

50 GGGGATCATC AATGAAACCA ATCACTGACT ATGCTCCCGC TTTAGAATAT 250 

GGAGTCTATG ACTCTACTGC TTCTATTGTA CATGATGTCC CTTATAACTA 300 

TCCTGGCACT GATACTCCAC TCTACAACTG GGATCATGTC TACTTTGGAA 350 

ACATTACAAT CCAGTATGCT CTTCAACAAT CACGAAATGT CACAGCCGTT 4 00 

GAGACTTTGA ATAAGGTCGG TCTAGATAGA GCTAAAACCT TCCTTAATGG 4 50 

55- TCTTGGTATC GACTATCCAA GCATGCATTA TGCAAACGCC ATTTCAAGTA 500 

ACACAACTGA ATCCAACAAA AAATATGGTG CAAGTAGTGA AAAAATGGCT 550 

GCTGCCTACG CTGCTTTTGC TAATGGTGGT ATTTATCACA AACCAATGTA 600 

TATCAATAAA ATCGTCTTTA GTGATGGTAG CGAAAAAGAA TTTTCTGATG 650 

CTGGTACACG AGCTATGAAA GAGACTACTG CCTATATGAT GACTGAAATG 700 

60 ATGAAAACTG TTTTAACTTA CGGAACAGGA CGTGGAGCCT ACCTACCATG 750 

563 
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WO 01/23604 

GCTTCCACAA GCAGGTAAGA CAGGTACTTC 
TTGAAAAGTA TATCAAGAAC ACTGGCTACG 
GTGGGTTATA CTCGTAAGTA TTCTATGGCT 
TCGTTTAACT CCTATCGTTG GAGATGGTTT 
5 ATCGCTCTAT GATGACCTAC CTGTCTGAAG 
AATATACCAG AGGGGCTCTA CAGAAATGGA 
TGCTCGTTCT ACGTGGAGCT CACCTGCTCC 
AAAGTTCAAG CTCATCATCA GATAGTTCAA 
ACTCCAAGCA CAAATAATAG TACGACTACC 
10 ACAATCAAAT ACAACCCCTG 



PCT/CA00/01150 

TAACTATACT GACGAAGAAA 800 

TAGCTCCAGA TGAAATGTTT 850 

GTATGGACAG GTTATTCGAA 900 

CCTAGTTGCA GCTAAAGTTT 950 

GAAGCAATCC AGAGGATTGG 1000 

GAATTCGTAT TTAAAAATGG 1050 

ACAACAACCC CCATCAACTG 1100 

CTTCACAGTC TAGCTCAACC 1150 

AATCCTAACA ATAATACGCA 1200 

1220 



2) INFORMATION FOR SEQ ID NO: 1014 



15 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1199 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
2 0 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
25 (A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-11 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1014 

30 CAAAAACATC TGTGGGATAT TTACAATACA GACGAATACG TTGCCTATCC 50 

AGACGATGAA TTGCAAGTCG CTTCTACCAT TGTTGATGTT TCTAACGGTA 100 

AAGTCATTGC CCAGCTAGGA GCACGCCATC AGTCAAGTAA TGTTTCCTTC 150 

GGAATTAACC AAGCAGTAGA AACAAACCGC GACTGGGGAT CAACTATGAA 2 00 

ACCGATCACA GACTATGCTC CTGCCTTGGA GTACGGTGTC TACGATTCAA 250 

35 CTGCTACTAT CGTTCACGAT GAGCCCTATA ACTACCCTGG GACAAATACC 300 

CCTGTTTATA ACTGGGATAG GGGCTACTTT GGCAACATCA CCTTGCAATA 350 

CGCCCTGCAA CAATCGCGAA ACGTCCCAGC CGTGGAAACT CTAAACAAGG 4 00 

TCGGACTCAA CCGCGCCAAG ACTTTCCTAA ATGGTCTCGG AATCGACTAC 450 

CCAAGTATTC ACTACTCAAA TGCCATTTCA AGTAACACAA CCGAATCAGA 500 

4 0 CAAAAAATAT GGAGCAAGTA GTGAAAAGAT GGCTGCTGCT TACGCTGCCT 550 

TTGCAAATGG TGGAACTTAC TATAAACCAA TGTATATCCA TAAAGTCGTC 600 

TTTAGTGATG GGAGTGAAAA AGAGTTCTCT AATGTCGGAA CTCGTGCCAT 650 

GAAGGAAACG ACAGCCTATA TGATGACCGA. CATGATGAAA ACTGTCTTAG 700 

TATACGGAAT CGGACGTGGA GCCTACCTAC CTTGGCTTCC ACAAGCAGGT 750 

45 AAAACAGGTA CTTCTAACTA TACTGACGAA GAAATTGAAA AGTATATCAA 800 

GAACACTGGT TACGTAGCTC CAGATGAAAT GTTTGTAGGG TATACCCGCA 850 

AATATGCAAT GGCTGTATGG ACAGGCTATT CTAACCGTCT GACACCACTT 900 

GTAGGCGATG GCCTTACGGT CGCTGCTAAA GTTTACCGCT CTATGATGAC 950 

CTACCTGTCT GAAGGAAGCA ATCCAGAGGA TTGGAATATA CCAGAGGGGC 1000 

50 TCTACAGAAA TGGAGAATTC GTATTTAAAA ATGGTGCTCG TTCTACGTGG 1050 

AACTCACCTG CTCCACAACA ACCCCCATCA ACTGAAAGTT CAAGCTCATC 1100 

ATCAGATAGT TCAACTTCAC AGTCTAGCTC AACCACTCCA AGCACAAATA 1150 

ATAGTACGAC TACCAATCCT AACAATAATA CGCAACAATC AAATACAAC 1199 



55 



2) INFORMATION FOR SEQ ID NO: 1015 



(i) SEQUENCE CHARACTERISTICS: 
60 (A) LENGTH: 1211 bases 

564 



SUBSTITUTE SHEET (RULE 26) 



10 



WO 01/23604 

(B) 
(C) 
(D) 



PCT/CA00/01150 



TYPE: Nucleic, acid 
STRANDEDNESS: Double 
TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-12 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1015 



AAGCTCAAAA 
TACCCTGACG 

15 TGGTAAAGTC 
CATTTGGTAC 
ATGAAACCAA 
TTCCACTGCA 
GCACACCTGT 

20 CAATATGCTC 
TAAGGTCGGT 
ACTATCCAAG 
TCTAATAAAC 
TGCCTTTGCC 

25 TCGTCTTCAG 
GCTATGAAAG 
CTTGGCATAC 
CTGGTAAGAC 
ATCAAGAACA 

30 TCGTAAGTAT 
CTATCGTTGG 
ATAACGTATC 
GGGGCTCTAC 
CGTGGAGCTC 

35 TCATCATCAG 
AAATAATAGT 
CAACCCCTGA 



ACATCTGTGG 
ATGATTTGCA 
ATCGCCCAAC 
CAACCAAGCT 
TCACCGATTA 
ACTATGGTTA 
CTACAACTGG 
TTCAACAATC 
CTAGATAGAG 
CATGCATTAT 
AATACGGAGC 
AACGGTGGAA 
TGACGGTAGT 
AAACAACTGC 
GGAACTGGTC 
AGGTACTTCT 
CTGGCTATGT 
TCTATGGCTG 
AGATGGTTTC 
TGTCTGAAGG 
AGAAATGGAG 
ACCTGCTCCA 
ATAGTTCAAC 
ACGACTACCA 
T 



GATATCTACA 
AGTCGCATCT 
TTGGAGCTCG 
GTGGAAACCA 
TGCACCTGCC 
ATGATATTCC 
GATAGAGCAT 
ACGAAATGTC 
CTAAAACCTT 
GCAAACGCCA 
AAGTAGTGAA 
TTTACCACAA 
AAAAAAGAAT 
TTACATGATG 
GTGGAGCCTA 
AACTACACAG 
AGCTCCAGAT 
TATGGACAGG 
CTAGTTGCAG 
AAGCAATCCA 
AATTCGTATT 
CAACAACCCC 
TTCACAGTCT 
ATCCTAACAA 



ACTCCGATCA 
ACGGTCGTAG 
TCACCAAGCA 
ATCGTGACTG 
ATAGAATACG 
TTATAACTAT 
ATTTCGGTAA 
ACAGCCGTTG 
CCTTAATGGT 
TTTCAAGTAA 
AAAATGGCTG 
ACCCATGTAT 
TTTCAGATGT 
ACCGAAATGA 
TCTCCCATGG 
ATGATGAAAT 
GAAATGTTTG 
TTATTCGAAT 
CTAAAGTTTA 
GAGGATTGGA 
TAAAAATGGT 
CATCAACTGA 
AACTCAACCA 
TAATACGCAA 



ATACGTCTCT 50 

ATGTTTCAAA 100 

AGTAACGTTT 150 

GGGTTCTGCT 200 

GTGTTTATGA 250 

CCGGGAACAA 300 

TATTACTCTG 350 

AGACTTTGAA 4 00 

CTTGGTATCG 450 

TACAACAGAA 500 

CTGCTTATGC 550 

ATCAATAAGG 600 

AGGTACACGA 650 

TGAAAACTGT 700 

TTAGCGCAAG 750 

TGAAAAACAC 800 

TTGGTTATAC 850 

CGTTTAACTC 900 

TCGCTCAATG 950 

ATATACCAGA 1000 

GCTCGTTCTA 1050 

AAGTTCAAGC 1100 

CTCCAAGCAC 1150 

CAATCAAATA 12 00 
1211 



40 



50 



2) INFORMATION FOR SEQ ID NO: 1016 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1222 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-13. 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1016 

GTAGACCAAG AAGCTCAAAA ACATCTGTGG GATATCTACA ACTCCGATCA 50 

ATACGTCTCT TACCCTGACG ATGATTTGCA AGTCGCATCT ACGGTCGTAG 100 

ATGTTTCAAA TGGTAAAGTC ATCGCACAAC TTGGTGCTCG TCATCAAGCA 150 

60 AGTAATGTTT CATTCGGTAC CAACCAGGCC GTAGAAACCA ATCGTGACTG 2 00 

565 



SUBSTITUTE SHEET (RULE 26) 
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GGGATCATCA ATGAAACCAA TCACTGACTA 

GAGTCTATGA CTCTACTGCT TCTATTGTAC 

CCTGGCACTG ATACTCCACT CTACAACTGG 

CATTACAATC CAGTATGCTC TTCAACAATC 

5 AGACTTTGAA TAAGGTCGGT CTAGATAGAG 

CTTGGTATCG ACTATCCAAG CATGCATTAT 

CACAACTGAA TCCAACAAAA AATATGGTGC 

CTGCCTACGC TGCTTTTGCT AATGGTGGTA 

ATCAATAAAA TCGTCTTTAG TGATGGTAGC 

10 TGGTACACGA GCTATGAAAG AGACTACTGC 

TGAAAACTGT TTTAACTTAC GGAACAGGAC 

CTTCCACAAG CAGGTAAGAC AGGTACTTCT 

TGAAAAGTAT ATCAAGAACA CTGGCTACGT 

TGGGTTATAC TCGTAAGTAT TCTATGGCTG 

15 CGTTTAACTC CTATCGTTGG AGATGGTTTC 

TCGCTCTATG ATGACCTACC TGTCTGAAGG 
ATATACCAGA GGGGCTCTAC AGAAATGGAG 

GCTCGTTCTA CGTGGAGCTC ACCTGCTCCA 

AAGTTCAAGC TCATCATCAG ATAGTTCAAC 

20 CTCCAAGCAC AAATAATAGT ACGACTACCA 
CAATCAAATA CAACCCCTGA TC 



PCT/CAOO/01150 

TGCTCCCGCT TTAGAATATG 250 

ATGATGTCCC TTATAACTAT 300 

GATCATGTCT ACTTTGGAAA 3 50 

ACGAAATGTC ACAGCCGTTG 4 00 

CTAAAACCTT CCTTAATGGT 450 

GCAAACGCCA TTTCAAGTAA 500 

AAGTAGTGAA AAAATGGCTG 550 

TTTATCACAA ACCAATGTAT 600 

GAAAAAGAAT TTTCTGATGC 650 

CTATATGATG ACTGAAATGA 7 00 

GTGGAGCCTA CCTACCATGG 750 

AACTATACTG ACGAAGAAAT 800 

AGCTCCAGAT GAAATGTTTG 850 

TATGGACAGG TTATTCGAAT 900 

CTAGTTGCAG CTAAAGTTTA 950 

AAGCAATCCA GAGGATTGGA 1000 

AATTCGTATT TAAAAATGGT 1050 

CAACAACCCC CATCAACTGA 1100 

TTCACAGTCT AGCTCAACCA 1150 

ATCCTAACAA TAATACGCAA 1200 

1222 



25 2) INFORMATION FOR SEQ ID NO: 1017 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1229 bases 

(B) TYPE: Nucleic acid 
30 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

3 5 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-14 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1017 



TGTAGACCAA GAAGCTCAAA AACATCTGTG GGATATTTAC AATACAGACG 50 

AATACGTTGC CTATCCAGAC GATGAATTGC AAGTCGCTTC TACCATTGTT 100 

GATGTTTCTA ACGGTAAAGT CATTGCCCAG CTAGGAGCAC GCCATCAGTC 150 

AAGTAATGTT TCCTTCGGAA TTAACCAAGC AGTAGAAACA AACCGCGACT 200 

45 GGGGATCAAC TATGAAACCG ATCACAGACT ATGCTCCTGC CTTGGAGTAC 250 

GGTGTCTACG ATTCAACTGC TACTATCGTT CACGATGAGC CCTATAACTA 3 00 

CCCTGGGACA AATACCCCTG TTTATAACTG GGATAGGGGC TACTTTGGCA 3 50 

ACATCACCTT GCAATACGCC CTGCAACAAT CGCGAAACGT CCCAGCCGTG 4 00 

GAAACTCTAA ACAAGGTCGG ACTCAACCGC GCCAAGACTT TCCTAAATGG 4 50 

50 TCTCGGAATC GACTACCCAA GTATTCACTA CTCAAATGCC ATTTCAAGTA 500 

ACACAACCGA ATCAGACAAA AAATATGGAG CAAGTAGTGA AAAGATGGCT 550 

GCTGCTTACG CTGCCTTTGC AAATGGTGGA ACTTACTATA AACCAATGTA 600 

TATCCATAAA GTCGTCTTTA GTGATGGGAG TGAAAAAGAG TTCTCTAATG 650 

TCGGAACTCG TGCCATGAAA GAAACAACTG CTTACATGAT GACCGAAATG 700 

55 ATGAAAACTG TCCTGGCATA CGGAAGTGGT CGTGGAGCCT ATCTCCCATG 750 

GTTAGCGCAA GCTGGTAAGA CAGGTACTTC TAACTACACA GATGATGAAA 800 

TTGAAAAACA CATCAAGAAC ACTGGCTATG TAGCTCCAGA TGAAATGTTT 850 

GTTGGTTATA CTCGTAAGTA TTCTATGGCT GTATGGACAG GTTATTCGAA 900 

TCGTTTAACT CCTATCGTTG GAGATGGTTT CCTAGTTGCA GCTAAAGTTT 950 

60 ATCGCTCAAT GATAACGTAT CTATCAAAAG ATACTCATCC AGAAGACTGG 1000 

566 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

ACGATGCCAG ACGGACTTTT CAGAAACGGG 
AGCTCGTTCT ACGTGGAACT CACCTGCTCC 
AAAGTTCAAG CTCATCATCA GATAGTTCAA 
ACTCCAAGCA CAAATAATAG TACGACTACC 
5 ACAATCAAAT ACAACCCCTG ATCAACAAA 



PCT/CAOO/01150 
GAATTTGTAT TCAAAAATGG 1050 
ACAACAACCC CCATCAACTG 1100 
CTTCACAGTC TAACTCAACC 1150 
AATCCTAACA ATAATACGCA 1200 

1229 



2) INFORMATION FOR SEQ ID NO: 1018 



10 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1225 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
15 (D) TOPOLOGY: Linear' 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
20 (A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-15 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1018 

25 ATGTAGACCA AGAGGCTCAA AAACGTCTGT GGGATATCTA CAACTCCGAT 50 

CAATACGTCT CTTACCCTGA CGATGATTTG CAAGTCGCAT CTACGGTCGT 100 

AGATGTTTCA AATGGTAAAG TCATCGCCCA ACTTGGAGCT CGTCACCAAG 150 

CAAGTAACGT TTCATTTGGT ACCAACCAAG CTGTGGAAAC CAACCGTGAC 200 

TGGGGATCAA GCATGAAACC AATCACTGAT TATGCCCCAG CCTTAGAATA 250 

30 TGGTGTATAT GATTCCACTG CAACTATGGT TAATGATATT CCTTATAACT 300 

ATCCGGGCAC AAGCACACCT GTCTACAACT GGGATCGAGC ATATTTTGGT 350 

AATATTAGCC TGCAATATGC CCTTCAACAA TCTCGTAACG TGCCTGCCGT 4 00 

TGAAACACTA AACAAGGTTG GTTTAGATAG AGCCAAAACT TTCCTAAATG 450 

GTTTGGGAAT TGACTATCCA AGTATTCACT ACTCAAATGC TATTTCAAGT 50 0 

35 AATACAACTG AATCTAGTAA ACAGTACGGG GCAAGCAGTG AGAAAATGGC 550 

TGCGGCTTAC GCTGCATTCG CTAATGGCGG TATTTACCAC AAACCAATGT 600 

ACATCAATAA AGTTGTCTTT AGTGATGGTA GCGAAAAAGA ATTTTCTGAT 650 

GCTGGTACAC GAGCTATGAA AGAGACTACT GCCTATATGA TGACTGAAAT 700 

GATGAAAACT GTTTTAACTT ACGGAACAGG ACGTGGAGCC TACCTACCAT 750 

40 GGCTTCCACA AGCAGGTAAG ACAGGTACTT CTAACTATAC TGACGACGAA 800 

ATTGAAAAGT ATATCAAGAA CACTGGCTAC GTAGCTCCAG ATGAAATGTT 850 

TGTGGGTTAT ACTCGTAAGT ATTCTATGGC TGTATGGACT GGATACTCAA 900 

ATCGTTTAAC TCCAATCATT GGAGATGGTT TCCTAGTTGC TGCCAAAGTT 950 

TATCGCTCAA TGATATCGTA TCTATCAGAA GATGACCATC CTGGAGATTG 1000 

4 5 GACAATGCCT GAGGGAGTAT ACAGAAGTGG AGAATTCGTA TTTAAAAATG 1050 

GTGCTCGTTC TACGTGGAGC TCCCCTGCTC CACAACAACC CCCATCAACT 1100 

GAAAGTTCAA GCTCATCATC AGATAGTTCA ACTTCACAGT CTAGCTTAAC 1150 

CACTCCAAGC ACAAATAATA GTACGACTAC CAATCCTAAC AACAATACGC 1200 

AACAATCAAA TACAACCCCT GATCA 1225 

50 



2) INFORMATION- FOR SEQ ID NO: 1019 

55 ■ (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1439 bases 
<B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

60 

567 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

(ii) MOLECULE TYPE: Genomic DNA 



PCT/CAOO/01150 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-01 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1019 



GCCTCTATTT 

10 TCGAAAGGTT 
GTGAAAAAGC 
GGCTATTCTC 
TGAAGAGACC 
AATATGGCAA 

15 AACAATATCA 
TTTGCTGAAA 
ATTCTGAAGG 
TTATCCATGT 
TGATTCCTTG 

20 AGGCTGCGAC 
CAAACCTTAA 
TTCTTGGTAT 
CCTTGGAGTA 
ATGGGCCAGA 

25 AACAGCTATG 
CTGCGACCGG 
GAGTATAGCT 
CTATACGCCC 
GTGTTCGTGT 

30 AAGGGAGGAC 
TAAGGTCAAT 
ATCAGGTTGC 
AATGGTGCCT 
TGTGGCAGAT 

35 CATCTGATAA 
AATCTAACAA 
GTATCAAAAA 



CAAAGGAGAT 
TTGGAAACTT 
TGGTCTCCCA 
TAAATGACCG 
TTACAAGGAA 
TATGGAAAGC 
AACTGACCAT 
AGTTATTTCA 
TGTCTATGCA 
CAGGGATCAA 
GGAACGGTAA 
CATCAGCTCA 
CAGATCAGCC 
AAATTGGCAT 
TTCATCCAAT 
CCTATCAACC 
GGAAAACTTC 
AATTGACCTA 
TTGCTAATTA 
ATGCAGTTGG 
GGCTCCTCGT 
TGGGTGACTT 
ATATCCGACT 
CCATGGTACT 
TGGTATCCAT 
GGTCAGCAAG 
TCCCCAAATC 
ATGGTGTAGG 
TACCATCCAA 



GCCTGGCATT 
CCCTTTCTTC 
GCGGAAGAAG 
TGTTGGAACC 
AACGCTCGGT 
GTGGACACAA 
TGATTTGGCC 
ATTCCGAGCT 
GTCGCCCTTA 
ACATGACCTG 
CCAATGTCTT 
GGTTGGGAAA 
TATTGTTTTC 
ATGGATCTTT 
GCTTACATGG 
AAATATGTTT 
GTGCGACCTT 
CCAGATGAAT 
CATCACCAAT 
CTCAGTATGT 
ATTGTTGAAG 
GATTCAGCAA 
CCGATATGAG 
AGTGGATTGA 
TAGCGGAAAA 
CAACCAATAC 
GCTGTCGCAG 
ACCTTCCATT 
TGAACTAGAA 



AGTATTTCTA 
TATAGTAGGG 
CAGAATCCTA 
TCCTATTTGG 
AAAAGAAATC 
TTGAGGAAGG 
TTCCAAGATA 
AGGAAATGGT 
ACCCAAAAAC 
AAAACGGGAG 
TGTCCCAGGT 
ATGGTGTTTT 
CAAGGTTCAG 
TCCTATTACA 
TTCAAACCGC 
GTTGGAACCA 
TGGCGAATAT 
CTACTGGATT 
TCCTTTGGGC 
AGCAACTATT 
GCATTTATGG 
CTGCAACCGA 
CATCTTGCAC 
CAACTGGACG 
ACAGGTACAG 
CAATGCGGTG 
TGGTCTTTCC 
GCGCGTGACA 
AGGAAATTA 



CTTCTTGGGA 50 

AGTGTATCCA 100 

TCTTAAAAAA 150 

AAAAGCAATA 2 00 

CATCTGGATA 250 

TAGTAAGGGA 3 00 

GCGTGGATGC 350 

GGAGCTAAAT 400 

AGGTGCTGTT 450 

AGTTGACTCC 500 

TCGGTTGTCA 550 

ATCAGGAAAC 600 

CTCCAATTTA 650 

GCTGTGGAAG 700 

TCTTGGAATC 750 

GCAATTTGGA 800 

GGCTTGGGGG 850 

TGTTCCCAAA 900 

AGTTTGATAA 950 

GCAAATAATG 1000 

TAATAATGAT 1050 

CAGAGATGAA 1100 

CAAGGTTTTT 1150 

TGCCTTTTCA 1200 

CCGAAAGCTA 1250 

GCCTATGCCC 13 00 

TCATAATACC 13 50 

TTATCAATCT 14 00 
1439 



40 



50 



2) INFORMATION FOR SEQ ID NO: 1020 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1441 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-02 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1020 

GCCTCTATTT CAAAGGAGAT GCCTGGCATT AGTATTTCTA CTTCTTGGGA 50 

TCGAAAGGTT TTGGAAACTT CCCTTTCTTC TATAGTAGGG AGTGTATCCA 100 

GTGAAAAAGC TGGTCTCCCA GCGGAAGAAG CAGAATCCTA TCTTAAAAAA 150 

60 GGCTATTCTC TAAATGACCG TGTTGGAACC TCCTATTTGG AAAAGCAATA 200 

568 
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TGAAGAGACC 
AATATGGCAA 
' AACAATATCA 
TTTGCTGAAA 
5 ATTCTGAAGG 
TTATCCATGT 
TGATTCCTTG 
AGGCTGCGAC 
CAAACCTTAA 

10 TTCTTGGTAT 
CCTTGGAGTA 
ATGGGCCAGA 
AACAGCTATG 
CTGCGACCGG 

15 GAGTATAGCT 
CTATACGCCC 
GTGTTCGTGT 
AAGGGAGGAC 
TAAGGTCAAT 

20 ATCAGGTTGC 
AATGGTGCCT 
TGTGGCAGAT 
CATCTGATAA 
AATCTAACAA 

25 GTATCAAAAA 



TTACAAGGAA 
TATGGAAAGC 
AACTGACCAT 
AGTTATTTCA 
TGTCTATGCA 
CAGGGATCAA 
GGAACGGTAA 
CATCAGCTCA 
CAGATCAGCC 
AAATTGGCAT 
TTCATCCAAT 
CCTATCAACC 
GGAAAACTTC 
AATTGACCTA 
TTGCTAATTA 
ATGCAGTTGG 
GGCTCCTCGT 
TGGGTGACTT, 
ATATCCGACT 
CCATGGTACT 
TGGTATCCAT 
GGTCAGCAAG 
TCCCCAAATC 
ATGGTGTAGG 
TACCATCCAA 



AACGCTCGGT 
GTGGACACAA 
TGATTTGGCC 
ATTCCGAGCT 
GTCGCCCTTA 
ACATGACCTG 
CCAATGTCTT 
GGTTGGGAAA 
TATTGTTTTC 
ATGGATCTTT 
GCTTACATGG 
AAATATGTTT 
GTGCGACCTT 
CCAGATGAAT 
CATCACCAAT 
CTCAGTATGT 
ATTGTTGAAG 
GATTCAGCAA 
CCGATATGAG 
AGTGGATTGA 
TAGCGGAAAA 
CAACCAATAC 
GCTGTCGCAG 
ACCTTCCATT 
TGAACTAGAA 



AAAAGAAATC 
TTGAGGAAGG 
TTCCAAGATA 
AGGAAATGGT 
ACCCAAAAAC 
AAAACGGGAG 
TGTCCCAGGT 
ATGGTGTTTT 
CAAGGTTCAG 
TCCTATTACA 
TTCAAACCGC 
GTTGGAACCA 
TGGCGAATAT 
CTACTGGATT 
TCCTTTGGGC 
AGCAACTATT 
GCATTTATGG 
CTGCAACCGA 
CATCTTGCAC 
CAACTGGACG 
ACAGGTACAG 
CAATGCGGTG 
TGGTCTTTCC 
GCGCGTGACA 
AGGAAATTAT 



PCT/CA00/01150 

CATCTGGATA 250 

TAGTAAGGGA 300 

GCGTGGATGC 350 

GGAGCTAAAT 4 00 

AGGTGCTGTT 450 

AGTTGACTCC 500 

TCGGTTGTCA 550 

ATCAGGAAAC 600 

CTCCAATTTA 650 

GCTGTGGAAG 700 

TCTTGGAATC 750 

GCAATTTGGA 800 

GGCTTGGGGG 850 

TGTTCCCAAA 900 

AGTTTGATAA 950 

GCAAATAATG 1000 

TAATAATGAT 1050 

CAGAGATGAA 1100 

CAAGGTTTTT 1150 

TGCCTTTTCA 1200 

CCGAAAGCTA 1250 

GCCTATGCCC 1300 

TCATAATACC 1350 

TTATCAATCT 1400 

G 1441 



30 



35 



40 



2) INFORMATION FOR SEQ ID NO: 1021 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1396 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-03 



(xi) SEQUENCE DESCRIPTION: 



45 TATTGCCTCT 
GGGATCGAAA 
TCCAGTGAAA 
AAAAGGCTAT 
AATATGAAGA 

50 GATAAATATG 
GGGAAACAAT 
ATGCTTTGCT 
AAATATTCTG 
TGTTTTATCC 

55 CTCCTGATTC 
GTCAAGGCTG 
AAACCAAACC 
TTTATTCTTG 
GAAGCCTTGG 

60 AATCATGGGC 



ATTTCAAAGG 
GGTTTTGGAA 
AAGCTGGTCT 
TCTCTAAATG 
GACCTTACAA 
GCAATATGGA 
ATCAAACTGA 
GAAAAGTTAT 
AAGGTGTCTA 
ATGTCAGGGA 
CTTGGGAACG 
CGACCATCAG 
TTAACAGATC 
GTATAAATTG 
AGTATTCATC 
CAGACCTATC 



AGATGCCTGG 
ACTTCCCTTT 
CCCAGCGGAA 
ACCGTGTTGG 
GGAAAACGCT 
AAGCGTGGAC 
CCATTGATTT 
TTCAATTCCG 
TGCAGTCGCC 
TCAAACATGA 
GTAACCAATG 
CTCAGGTTGG 
AGCCTATTGT 
GCATATGGAT 
CAATGCTTAC 
AACCAAATAT 



ID NO: 1021 




CATTAGTATT 


TCTACTTCTT 


50 


CTTCTATAGT 


AGGGAGTGTA 


100 


GAAGCAGAAT 


CCTATCTTAA 


150 


AACCTCCTAT 


TTGGAAAAGC 


200 


CGGTAAAAGA 


AATCCATCTG 


250 


ACAATTGAGG 


AAGGTAGTAA 


300 


GGCCTTCCAA 


GATAGCGTGG 


350 


AGCTAGGAAA 


TGGTGGAGCT 


400 


CTTAACCCAA 


AAACAGGTGC 


450 


CCTGAAAACG 


GGAGAGTTGA 


500 


TCTTTGTCCC 


AGGTTCGGTT 


550 


GAAAATGGTG 


TTTTATCAGG 


600 


TTTCCAAGGT 


TCAGCTCCAA 


650 


CTTTTCCTAT 


TACAGCTGTG 


700 


ATGGTTCAAA 


CCGCTCTTGG 


750 


GTTTGTTGGA 


ACCAGCAATT 


800 
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TGGAAACAGC TATGGGAAAA 
GGGGCTGCGA CCGGAATTGA 
CAAAGAGTAT AGCTTTGCTA 
ATAACTATAC GCCCATGCAG 
5 AATGGTGTTC GTGTGGCTCC 
TGATAAGGGA GGACTGGGTG 
TGAATAAGGT CAATATATCC 
TTTTATCAGG TTGCCCATGG 
TTCAAATGGT GCCTTGGTAT 
10 GCTATGTGGC AGATGGTCAG 
GCCCCATCTG ATAATCCCCA 
TACCAATCTA ACAAATGGTG 



CTTCGTGCGA CCTTTGGCGA 
CCTACCAGAT GAATCTACTG 
ATTACATCAC CAATTCCTTT 
TTGGCTCAGT ATGTAGCAAC 
TCGTATTGTT GAAGGCATTT 
ACTTGATTCA GCAACTGCAA 
GACTCCGATA TGAGCATCTT 
TACTAGTGGA TTGACAACTG 
CCATTAGCGG AAAAACAGGT 
CAAGCAACCA ATACCAATGC 
AATCGCTGTC GCAGTGGTCT 
TAGGACCTTC CATTGCGCGT 



ATATGGCTTG 850 

GATTTGTTCC 900 

GGGCAGTTTG 950 

TATTGCAAAT 1000 

ATGGTAATAA 1050 

CCGACAGAGA 1100 

GCACCAAGGT 1150 

GACGTGCCTT 1200 

ACAGCCGAAA 1250 

GGTGGCCTAT 1300 

TTCCTCATAA 1350 

GACATT 13 96 



15 



2) INFORMATION FOR- SEQ ID NO: 1022 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1428 bases 
20 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



25 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-04 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1022 

AAAGGAGATG CCTGGCATTA GTATTTCTAC TTCTTGGGAT CGAAAGGTTT 50 

TGGAAACTTC CCTTTCTTCT ATAGTAGGGA GTGTATCCAG TGAAAAAGCT 100 

GGTCTCCCAG CGGAAGAAGC AGAATCCTAT CTTAAAAAAG GCTATTCTCT 150 

35 AAATGACCGT GTTGGAACCT CCTATTTGGA AAAGCAATAT GAAGAGACCT 200 

TACAAGGAAA ACGCTCGGTA AAAGAAATCC ATCTGGATAA ATATGGCAAT 2 50 

ATGGAAAGCG TGGACACAAT TGAGGAAGGT AGTAAGGGAA ACAATATCAA 300 

ACTGACCATT GATTTGGCCT TCCAAGATAG CGTGGATGCT TTGCTGAAAA 350 

GTTATTTCAA TTCCGAGCTA GGAAATGGTG GAGCTAAATA TTCTGAAGGT 4 00 

40 GTCTATGCAG TCGCCCTTAA CCCAAAAACA GGTGCTGTTT TATCCATGTC 450 

AGGGATCAAA CATGACCTGA AAACGGGAGA GTTGACTCCT GATTCCTTGG 500 

GAACGGTAAC CAATGTCTTT GTCCCAGGTT CGGTTGTCAA GGCTGCGACC 550 

ATCAGCTCAG GTTGGGAAAA TGGTGTTTTA TCAGGAAACC AAACCTTAAC 600 

AGATCAGCCT ATTGTTTTCC AAGGTTCAGC TCCAATTTAT TCTTGGTATA 650 

4 5 AATTGGCATA TGGATCTTTT CCTATTACAG CTGTGGAAGC CTTGGAGTAT 700 

TCATCCAATG CTTACATGGT TCAAACCGCT CTTGGAATCA TGGGCCAGAC 750 

CTATCAACCA AATATGTTTG TTGGAACCAG CAATTTGGAA ACAGCTATGG 800 

GAAAACTTCG TGCGACCTTT GGCGAATATG GCTTGGGGGC TGCGACCGGA 850 

ATTGACCTAC CAGATGAATC TACTGGATTT GTTCCCAAAG AGTATAGCTT 900 

50 TGCTAATTAC ATCACCAATT CCTTTGGGCA GTTTGATAAC TATACGCCCA 950 

TGCAGTTGGC TCAGTATGTA GCAACTATTG CAAATAATGG TGTTCGTGTG 1000 

GCTCCTCGTA TTGTTGAAGG CATTTATGGT AATAATGATA AGGGAGGACT 1050 

GGGTGACTTG ATTCAGCAAC TGCAACCGAC AGAGATGAAT AAGGTCAATA 1100 

TATCCGACTC CGATATGAGC ATCTTGCACC AAGGTTTTTA TCAGGTTGCC 1150 

55 CATGGTACTA GTGGATTGAC AACTGGACGT GCCTTTTCAA ATGGTGCCTT 1200 

GGTATCCATT AGCGGAAAAA CAGGTACAGC CGAAAGCTAT GTGGCAGATG 1250 

GTCAGCAAGC AACCAATACC AATGCGGTGG CCTATGCCCC ATCTGATAAT 13 00 

CCCCAAATCG CTGTCGCAGT GGTCTTTCCT CATAATACCA ATCTAACAAA 13 50 

TGGTGTAGGA CCTTCCATTG CGCGTGACAT TATCAATCTG TATCAAAAAT 1400 

60 ACCATCCAAT GAACTAGAAA GGAAATTA 1428 
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2) INFORMATION FOR SEQ ID NO: 1023 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1442 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

•(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-05 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1023 

TGCCTCTATT TCAAAGGAGA TGCCTGGCAT TAGTATTTCT ACTTCTTGGG 50 

ATCGAAAGGT TTTGGAAACT TCCCTTTCTT CTATAGTAGG GAGTGTATCC 100 

AGTGAAAAAG CTGGTCTCCC AGCGGAAGAA GCAGAATCCT ATCTTAAAAA 150 

AGGCTATTCT CTAAATGACC GTGTTGGAAC CTCCTATTTG GAAAAGCAAT 200 

ATGAAGAGAC CTTACAAGGA AAACGCTCGG TAAAAGAAAT CCATCTGGAT 250 

AAATATGGCA ATATGGAAAG CGTGGACACA ATTGAGGAAG GTAGTAAGGG 300 

AAACAATATC AAACTGACCA TTGATTTGGC CTTCCAAGAT AGCGTGGATG 350 

CTTTGCTGAA AAGTTATTTC AATTCCGAGC TAGGAAATGG TGGAGCTAAA 4 00 

TATTCTGAAG GTGTCTATGC AGTCGCCCTT AACCCAAAAA CAGGTGCTGT 4 50 

TTTATCCATG TCAGGGATCA AACATGACCT GAAAACGGGA GAGTTGACTC 500 

CTGATTCCTT GGGAACGGTA ACCAATGTCT TTGTCCCAGG TTCGGTTGTC 550 

AAGGCTGCGA CCATCAGCTC AGGTTGGGAA AATGGTGTTT TATCAGGAAA 600 

CCAAACCTTA ACAGATCAGC CTATTGTTTT CCAAGGTTCA GCTCCAATTT 650 

ATTCTTGGTA TAAATTGGCA TATGGATCTT TTCCTATTAC AGCTGTGGAA 700 

GCCTTGGAGT ATTCATCCAA TGCTTACATG GTTCAAACCG CTCTTGGAAT 750 

CATGGGCCAG ACCTATCAAC CAAATATGTT TGTTGGAACC AGCAATTTGG 800 

AAACAGCTAT GGGAAAACTT CGTGCGACCT TTGGCGAATA TGGCTTGGGG 850 

GCTGCGACCG GAATTGACCT ACCAGATGAA TCTACTGGAT TTGTTCCCAA 900 

AGAGTATAGC TTTGCTAATT ACATCACCAA TTCCTTTGGG CAGTTTGATA 950 

ACTATACGCC CATGCAGTTG GCTCAGTATG TAGCAACTAT TGCAAATAAT 1000 

GGTGTTCGTG TGGCTCCTCG TATTGTTGAA GGCATTTATG GTAATAATGA 1050 

TAAGGGAGGA CTGGGTGACT TGATTCAGCA ACTGCAACCG ACAGAGATGA 1100 

ATAAGGTCAA TATATCCGAC TCCGATATGA GCATCTTGCA CCAAGGTTTT 1150 

TATCAGGTTG CCCATGGTAC TAGTGGATTG ACAACTGGAC GTGCCTTTTC 12 00 

AAATGGTGCC TTGGTATCCA TTAGCGGAAA AACAGGTACA GCCGAAAGCT 1250 

ATGTGGCAGA TGGTCAGCAA GCAACCAATA CCAATGCGGT GGCCTATGCC 1300 

CCATCTGATA ATCCCCAAAT CGCTGTCGCA GTGGTCTTTC CTCATAATAC 1350 

CAATCTAACA AATGGTGTAG GACCTTCCAT TGCGCGTGAC ATTATCAATC 14 00 

TGTATCAAAA ATACCATCCA ATGAACTAGA AAGGAAATTA TG 1442 



2) INFORMATION FOR SEQ ID NO: 1024 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1445 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-06 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1024 

TTGCCTCTAT TTCAAAGGAG ATGCCTGGCA TTAGTATTTC TACTTCTTGG 50 

GATCGAAAGG TTTTGGAAAC TTCCCTTTCT TCTATAGTAG GGAGTGTATC 100 

10 CAGTGAAAAA GCTGGTCTCC CAGCGGAAGA AGCAGAATCC TATCTTAAAA 150 

AAGGCTATTC TCTAAATGAC CGTGTTGGAA CCTCCTATTT GGAAAAGCAA 2 00 

TATGAAGAGA CCTTACAAGG AAAACGCTCG GTAAAAGAAA TCCATCTGGA 250 

TAAATATGGC AATATGGAAA GCGTGGACAC AATTGAGGAA GGTAGTAAGG 300 

GAAACAATAT CAAACTGACC ATTGATTTGG CCTTCCAAGA TAGCGTGGAT 350 

15 GCTTTGCTGA AAAGTTATTT CAATTCCGAG CTAGGAAATG GTGGAGCTAA 400 

ATATTCTGAA GGTGTCTATG CAGTCGCCCT TAACCCAAAA ACAGGTGCTG 4 50 

TTTTATCCAT GTCAGGGATC AAACATGACC TGAAAACGGG AGAGTTGACT 500 

CCTGATTCCT TGGGAACGGT, AACCAATGTC TTTGTCCCAG GTTCGGTTGT 550 

CAAGGCTGCG ACCATCAGCT CAGGTTGGGA AAATGGTGTT TTATCAGGAA 600 

20 ACCAAACCTT AACAGATCAG CCTATTGTTT TCCAAGGTTC AGCTCCAATT 650 

TATTCTTGGT ATAAATTGGC ATATGGATCT TTTCCTATTA CAGCTGTGGA 700 

AGCCTTGGAG TATTCATCCA ATGCTTACAT GGTTCAAACC GCTCTTGGAA 750 

TCATGGGCCA GACCTATCAA CCAAATATGT TTGTTGGAAC CAGCAATTTG 800 

GAAACAGCTA TGGGAAAACT TCGTGCGACC TTTGGCGAAT ATGGCTTGGG 850 

25 GGCTGCGACC GGAATTGACC TACCAGATGA ATCTACTGGA TTTGTTCCCA 900 

AAGAGTATAG CTTTGCTAAT TACATCACCA ATTCCTTTGG GCAGTTTGAT 950 

AACTATACAC CCATGCAGTT GGCTCAGTAT GTAGCAACTA TTGCAAATAA 1000 

TGGTGTTCGT GTGGCTCCTC GTATTGTTGA AGGCATTTAT GGTAATAATG 1050 

ATAAGGGAGG ACTGGGTGAC TTGATTCAGC AACTGCAACC GACAGAGATG 1100 

30 AATAAGGTCA ATATATCCGA CTCCGATATG AGCATCTTGC ACCAAGGTTT 1150 

TTATCAGGTT GCCCATGGTA CTAGTGGATT GACAACTGGA CGTGCCTTTT 1200 

CAAATGGTGC CTTGGTATCC ATTAGCGGAA AAACAGGTAC AGCCGAAAGC 1250 

TATGTGGCAG ATGGTCAGCA AGCAACCAAT ACCAATGCGG TGGCCTATGC 1300 

CCCATCTGAT AATCCCCAAA TCGCTGTCGC AGTGGTCTTT CCTCATAATA 1350 

35 CCAATCTAAC AAATGGTGTA GGACCTTCCA TTGCGCGTGA CATTATCAAT 1400 

CTGTATCAAA AATACCATCC AATGAACTAG AAAGGAAATT ATGCT 1445 



40 2) INFORMATION FOR SEQ ID NO: 1025 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1441 bases 

(B) TYPE: Nucleic acid 
45 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

50 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-07 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1025 



TTGCCTCTAT TTCAAAGGAG ATGCCTGGCA TTAGTATTTC TACTTCTTGG 50 

GATCGAAAGA TTTTGGAAAC TTCCCTTTCT TCTATAGTAG GGAGTGTATC 100 

CAGTGAAAAA GCTGGTCTCC CAGCGGAAGA AGCAGAATCC TATCTTAAAA 150 

AAGGCTATTC TCTAAATGAC CGTGTTGGAA CCTCGTATTT GGAAAAGCAA 200 

60 TATGAAGAGA CCTTACAAGG AAAACGCTCG GTAAAAGAAA TCCATCTGGA 250 
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TAAATATGGC AATATGGAAA GCGTGGACAC AATTGAGGAA GGTAGTAAGG 300 

GAAACAATAT CAAACTGACC ATTGATTTGG CCTTCCAAGA TAGCGTGGAT 350 

GCTTTGCTGA AAAGTTATTT CAATTCCGAG CTAGGAAATG GTGGAGCCAA 4 00 

GTATTCTGAG GGTGTGTATG CAGTCGCCCT TAACCCCAAA ACAGGTGCTG 450 

5 TTTTGTCTAT GTCAGGACTC AAACATGACC TGAAAACGGG AGAGTTGACT 500 

CCTGATTCCT TGGGAACGGT AACCAATGTC TTTGTCCCAG GTTCGGTTGT 550 

CAAGGCTGCG ACCATCAGCT CTGGCTGGGA AAATGGTGTT TTATCAGGAA 600 

ACCAAACCTT AACAGATCAG CCTATTGTTT TCCAAGGTTC AGCTCCAATT 650 

TATTCTTGGT ATAAATTGGC ATATGGATCT TTTCCTATTA CAGCTGTGGA 700 

10 AGCCTTGGAG TATTCATCCA ATGCTTACAT GGTTCAAACC GCTCTTGGAA 750 

TCATGGGCCA GACCTATCAA CCAAATATGT TTGTTGGAAC CAGCAATTTG 800 

GAAACAGCTA TGGGAAAACT TCGTGCGACC TTTGGCGAAT ATGGCTTGGG 850 

GGCTGCGACC GGAATTGACC TACCAGATGA ATCTACTGGA TTTGTTCCCA 900 

AAGACTATAG CTTTGCTAAT TACATCACCA ATGCCTTTGG GCAGTTTGAT 950 

15 AACTATACGC CCATGCAGTT GGCTCAGTAT, GTAGCAACTA TTGCAAATGA 1000 

TGGTGTTCGT GTGGCTCCTC GTATTGTTGA AGGCATTTAT GGTAATAATG 1050 

ATAAGGGAGG ACTGGGTGAC TTGATTCAGC AACTGCAACC GACAGAGATG 1100 

AATAAGGTCA ATATATCCGA CTCCGATATG AGTATCTTGC ACCAAGGATT 1150 

TTACCAAGTA TCGCATGGAA CTAGTCCCCT TACGACAGGA CGGGCGTTTT 12 00 

20 CAGATGGCGC CACTGTTTCT ATCAGTGGTA AGACCGGTAC AGGTGAAAGC 12 50 

TATGTAGCTG GTGGTCAAGA AGCTAATAAT ACCAATGCCG TGGCCTATGC 1300 

TCCAACAGAA AATCCTCAAA TTGCAGTTGC AGTAGTCTTT CCTCATAATA 1350 

CCAATTTAAC CAAAAATGTT GGGCCAGCAA TTGCTCGCGA CATTATCAAT 14 00 

TTATATAACC AACACCATCC AATGAATTAG AAAGGAAGCC A 1441 

25 



2) INFORMATION FOR SEQ ID NO: 1026 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1443 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



35 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 
40 (B) STRAIN: StrR-08 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1026 

TATTGCCTCT ATTTCAAAGG AGATGCCTGG CATTAGTATT TCTACTTCTT 50 

45 GGGATCGAAA GGTTTTGGAA ACTTCCCTTT CTTCTATAGT AGGGAGTGTA 100 

TCCAGTGAAA AAGCTGGTCT CCCAGCGGAA GAAGCAGAAT CCTATCTTAA 150 

AAAAGGCTAT TCTCTAAATG ACCGTGTTGG AACCTCCTAT TTGGAAAAGC 200 

AATATGAAGA GACCTTACAA GGAAAACGCT CGGTAAAAGA AATCCATCTG 250 

GATAAATATG GCAATATGGA AAGCGTGGAC ACAATTGAGG AAGGTAGTAA 300 

50 GGGAAACAAT ATCAAACTGA CCATTGATTT GGCCTTCCAA GATAGCGTGG 350 

ATGCTTTGCT GAAAAGTTAT TTCAATTCCG AGCTAGGAAA TGGTGGAGCT 400 

AAATATTCTG AAGGTGTCTA TGCAGTCGCC CTTAACCCAA AAACAGGTGC 450 

TGTTTTATCC ATGTCAGGGA TCAAACATGA CCTGAAAACG GGAGAGTTGA 500 

CTCCTGATTC CTTGGGAACG GTAACCAATG TCTTTGTCCC AGGTTCGGTT 550 

55 GTCAAGGCTG CGACCATCAG CTCAGGTTGG GAAAATGGTG TTTTATCAGG 600 

AAACCAAACC TTAACAGATC AGCCTATTGT TTTCCAAGGT TCAGCTCCAA 650 

TTTATTCTTG GTATAAATTG GCATATGGAT CTTTTCCTAT TACAGCTGTG 700 

GAAGCCTTGG AGTATTCATC CAATGCTTAC ATGGTTCAAA CCGCTCTTGG 750 

AATCATGGGC CAGACCTATC AACCAAATAT GTTTGTTGGA ACCAGCAATT 800 

60 TGGAAACAGC TATGGGAAAA CTTCGTGCGA CCTTTGGCGA ATATGGCTTG 850 
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GGGGCTGCGA CCGGAATTGA CCTACCAGAT GAATCTACTG GATTTGTTCC 900 

CAAAGAGTAT AGCTTTGCTA ATTACATCAC CAATTCCTTT GGGCAGTTTG 950 

ATAACTATAC GCCCATGCAG TTGGCTCAGT ATGTAGCAAC TATTGCAAAT 1000 

AATGGTGTTC GTGTGGCTCC TCGTATTGTT GAAGGCATTT ATGGTAATAA 1050 

5 TGATAAGGGA GGACTGGGTG ACTTGATTCA GCAACTGCAA CCGACAGAGA 1100 

TGAATAAGGT CAATATATCC GACTCCGATA TGAGCATCTT GCACCAAGGT 1150 

TTTTATCAGG TTGCCCATGG TACTAGTGGA TTGACAACTG GACGTGCCTT 1200 

TTCAAATGGT GCCTTGGTAT CCATTAGCGG AAAAACAGGT ACAGCCGAAA 1250 

GCTATGTGGC AGATGGTCAG CAAGCAACCA ATACCAATGC GGTGGCCTAT 1300 

10 GCCCCATCTG ATAATCCCCA AATCGCTGTC GCAGTGGTCT TTCCTCATAA 1350 

TACCAATCTA ACAAATGGTG TAGGACCTTC CATTGCGCGT GACATTATCA 1400 

ATCTGTATCA AAAATACCAT CCAATGAACT AGAAAGGAAA TTA 1443 



15 



2) INFORMATION FOR SEQ ID NO: 1027 



(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 1443 bases 
20 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



25 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-09 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1027 

TATTGCCTCT ATTTCAAAGG AGATGCCTGG CATTAGTATT TCTACTTCTT 50 

GGGATAGAAA GGTTTTGGAA ACTTCCCTTT CTTCTATAGT TGGGAGTGTA 100 

TCCAGTGAAA AAGCTGGTCT CCCAGCGGAA GAAGCAGAAG CCTATCTTAA 150 

35 AAAAGGCTAT TCTCTAAATG ATCGTGTAGG AACCTCCTAT TTGGAAAAGC 200 

AATATGAAGA GACCTTACAA GGAAAACGCT CGGTAAAAGA AATCCATCTG 250 

GATAAATATG GCAATATGGA AAGCGTGGAT ACAATTGAGG AAGGTAGTAA 300 

GGGAAACAAT ATCAAACTGA CCATTGATTT GGCTTTCCAA GATAGCGTGG 350 

ATGCTTTACT GAAAAGTTAT TTCAATTCCG AGCTAGAAAA TGGTGGAGCC 4 00 

4 0 AAGTATTCTG AAGGTGTCTA TGCAGTCGCC CTTAACCCAA AAACAGGTGC 450 

TGTTTTGTCT ATGTCAGGGA TTAAACATGA CTTGAAAACG GGAGATTTAA 500 

CACCTGATTC CTTGGGAACA GTAACCAATG TCTTTGTCCC GGGTTCTGTT 550 

GTCAAGGCGG CGACCATCAG CTCTGGTTGG GAGAATGGAG TCTTATCAGG 600 

AAATCAGACC TTGACAGACC AACCGATTGT CTTCCAAGGT TCAGCTCCGA 650 

4 5 TTAATTCTTG GTACACTCAG GCTTACGATT CATTTCCGAT TACAGCGGTG 700 

GAAGCCTTGG AGTATTCTTC TAATGCCTAT ATGGTCCAAA CAGCTCTAGG 750 

TCTTATGGGG CAGACCTACC AACCCAATAT GTTTGTCGGC ACCAGCAATC 800 

TAGAGTCTGC TATGGGGAAA TTGCGTTCAA CCTTTGGTGA ATATGGCTTG 850 

GGCTCTGCGA CTGGGATTGA CCTACCAGAT GAATCTACTG GATTTGTTCC 900 

50 CAAAGAGTAT AGCTTTGCTA ATTACATTAC TAATGCCTTT GGGCAGTTTG 950 

ATAACTATAC GCCGATGCAG TTGGCTCAGT ATGTAGCAAC TATTGCAAAT 1000 

GATGGTGTTC GTGTGGCTCC TCGTATTGTG GAAGGCATTT ATGGCAATAA 1050 

TGATAAGGGA GGACTGGGTG ACTTGATTCA GCAACTGCAA CCGACAGAGA 1100 

TGAATAAGGT CAATATATCC GACTCCGATA TGAGTATCTT GCACCAAGGT 1150 

55 TTTTATCAGG TTGCTCATGG GACTAGCGGA TTGACAACAG GTCGTGCCTT 1200 

TTCCAATGGT GCAGCTGTAT CCATTAGTGG AAAAACAGGT ACCGCCGAAA 12 50 

GTTATGTAGC AGGTGGCCAA GAAGCCAACA ATACTAATGC TGTAGCCTAT 1300 

GCACCATCAG ATAATCCTCA AATAGCTGTT GCTGTTGTCT TCCCTCATAA 13 50 

CACCAATCTA ACAAATGGTG TAGGACCTTC CATTGCGCGT GATATTATCA 1400 

60 ATCTGTATCA AAAATACCAT CCAATGAACT AGAAAGGAAT TTA 1443 
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2) INFORMATION FOR SEQ ID NO: 1028 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1446 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
10 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 
15 (A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1028 



20 TATTGCCTCT ATTTCAAAGG AGATGCCTGG CATTAGTATT TCTACCTCTT 50 

GGGATAGAAA GGTTTTGGAA ACTTCTCTTT CTTCTATAGT AGGGAGTGTA 100 

TCCAGTGAAA AAGCTGGTCT CCCAGCGGAA GAAGCAGAAG CCTATCTTAA 150 

AAAAGGCTAT TCTCTAAATG ACCGTGTTGG GACTTCTTAC CTGGAAAAAC 200 

AATACGAGGA AACCTTACAA GGAAAACGCT CGGTAAAAGA AATCCATCTG 250 

25 GATAAATATG GCAATATGGA AAGCGTGGAT ACAATTGAGG AAGGTAGTAA 300 

GGGAAACAAT ATCAAACTGA CCATTGATTT GGCCTTCCAA GATAGCGTGG 350 

ATGCTTTGCT GAAAAGTTAT TTCAATTCCG AGCTAGGAAA TGGTGGAGCT 400 

AAGTATTCTG AAGGCGTGTA TGCAGTCGCC CTTAACCCCA AAACAGGTGC 450 

TGTTTTGTCT ATGTCAGGAC TCAAACATGA CCTGAAAACG GGAGACTTGA 500 

30 CGCCTGATTC CTTGGGAACG GTAACCAATG TCTTTGTCCC AGGGTCAGTA 550 

GTTAAGGCCG CTACCATCAG CTCAGGTTGG GAAAATGGTG TTTTATCAGG 600 

AAACCAAACC TTAACAGATC AGCCTATTGT TTTCCAAGGT TCAGCTCCAA 650 

TTTATTCTTG GTATAAATTG GCATATGGAT CTTTTCCTAT TACAGCTGTG 700 

GAAGCCTTGG AGTATTCATC TAATGCTTAC ATGGTTCAAA CCGCTCTTGG 750 

35 AATCATGGGC CAGACCTATC AACCAAATAT GTTTGTTGGA ACCAGCAATT 800 

TGGAAACAGC TATGGGAAAA CTTCGTGCGA CCTTTGGCGA ATATGGCTTG 850 

GGGGCTGCGA CCGGAATTGA CCTACCAGAT GAATCTACTG GATTTGTTCC 900 

CAAAGAGTAT AGCTTTGCTA ATTACATTAC TAATGCCTTT GGGCAGTTTG 950 

ATAACTATAC GCCGATGCAG TTGGCTCAGT ATGTAGCAAC TATTGCAAAT 1000 

40 GATGGTGTTC GTGTGGCTCC TCGTATTGTT GAAGGCATTT ATGGTAATAA 1050 

TGATAAGGGA GGACTGGGTG ACTTGATTCA GCAACTGCAA CCGACAGAGA 1100 

TGAATAAGGT CAATATATCC GACTCCGATA TGAGCATCTT GCACCAAGGT 1150 

TTTTATCAGG TTGCCCATGG TACTAGTGGA TTGACAACTG GACGTGCCTT 1200 

TTCAAATGGC GCCTTGGTAT CCATTAGCGG AAAAACAGGT ACAGCCGAAA 1250 

4 5 GCTATGTGGC AGATGGTCAG CAAGCAACCA ATACCAATGC GGTGGCCTAT 1300 

GCCCCATCTG ATAATCCCCA AATCGCTGTT GCAGTGGTCT TTCCTCATAA 1350 

TACCAATCTA ACAAATGGTG TAGGACCTTC CATTGCGCGT GACATTATCA 1400 

ATCTGTATCA AAAATACCAT CCAATGAACT AGAAAGGAAA TTATGC 144 6 

50 



2) INFORMATION FOR SEQ ID NO: 1029 

(i) SEQUENCE CHARACTERISTICS: 
55 (A) LENGTH: 1423 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

60 (ii) MOLECULE TYPE: Genomic DNA 

575 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-11 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1029 



TTGCCTCTAT 
GATAGAAAGG 

10 CAGTGAAAAA 
AAGGCTATTC 
TATGAAGAGA 
TAAATATGGC 
GAAACAATAT 

15 GCTTTACTGA 
GTATTCTGAA 
TTTTGTCTAT 
CCTGATTCCT 
CAAGGCGGCG 

20 ACCAGACCTT 
AATTCTTGGT 
AGCTCTGGAG 
TTATGGGGCA 
GAGTCTGCTA 

25 GGCTGCGACT 
AAGACTATAA 
AACTATACCC 
TGGTGTTCGT 
ATAAGGGAGG 

30 AATAAGGTCA 
TTATCAGGTT 
CAAATGGTGC 
TATGTGGCAG 
CCCATCTGAT 

35 CCAATCTAAC 
CTGTATCAAA 



TTCAAAGGAG 
TTTTGGAAAC 
GCTGGTCTCC 
TCTAAATGAC 
CCTTACAAGG 
AATATGGAAA 
CAAACTGACC 
AAAGTTATTT 
GGTGTCTATG 
GTCAGGGATT 
TGGGAACGGT, 
ACCATCAGCT 
GACAGACCAG 
ATACTCAGGC 
TATTCATCCA 
GACCTATCAA 
TGGGTAAATT 
GGGATTGATC 
CTTTGCCAAT 
CAATGCAATT 
GTGGCTCCTC 
ACTGGGTGAC 
ATATATCCGA 
GCTCATGGGA 
CTTGGTATCC 
ATGGTCAGGA 
AATCCCCAAA 
AAATGGTGTA 
AATACCATCC 



ATGCCTGGCA 
TTCCCTTTCT 
CAGCGGAAGA 
CGTGTAGGAA 
AAAACGCTCG 
GCGTGGATAC 
ATTGATTTGG 
CAATTCTGAG 
CAGTCGCCCT 
AAACATGACT 
AACCAATGTC 
CAGGTTGGGA 
TCCATTGTCT 
TTACGGTTCA 
ATGCTTATAT 
CCCAATATGT 
GCGTTCAACC 
TACCAGATGA 
TTCATTACCA 
GGCTCAGTAT 
GTATTGTTGA 
TTGATTCAGC 
CTCCGATATG 
CTAGTGGGTT 
ATTAGCGGAA 
AGCAACCAAT 
TCGCTGTCGC 
GGACCTTCCA 
AAT 



TTAGTATTTC 
TCTATAGTTG 
AGCAGAAGCC 
CCTCCTATTT 
GTAAAAGAAA 
AATTGAGGAA 
CCTTCCAAGA 
CTAGAAAATG 
TAACCCAAAA 
TGAAAACGGG 
TTTGTTCCAG 
AAATGGAGTC 
TCCAAGGTTC 
TTCCCTATCA 
GGTCCAAACA 
TTGTCGGCAC 
TTTGGCGAAT 
ATCTACTGGA 
ATGCCTTTGG 
GTAGCAACTA 
AGGCATTTAT 
AACTGCAACC 
AGTGTCTTGC 
GACAACTGGC 
AAACGGGTAC 
ACCAATGCGG 
AGTGGTCTTT 
TTGCGCGTGA 



TACTTCTTGG 50 

GGAGTGTATC 100 

TATCTTAAAA 150 

GGAAAAGCAA 200 

TCCATCTGGA 250 

GGTAGTAAGG 300 

TAGCGTGGAT 350 

GTGGAGCCAA 400 

ACAGGTGCGG 450 

AGAGTTGACG 500 

GTTCGGTTGT 550 

TTGTCAGGAA 600 

AGCTCCCATC 650 

CAGCGGTCCA 700 

GCCTTAGGTC 750 

CAGCAATCTA 800 

ATGGCTTGGG 850 

TTTGTTCCCA 900 

GCAGTTTGAT 950 

TTGCAAATGA 1000 

GGTAATAATG 1050 

GACAGAGATG 1100 

ACCAAGGTTT 1150 

CGTGCCTTTT 1200 

AGCCGAAAGC 1250 

TGGCCTATGC 1300 

CCTCATAATA 1350 

CATTATCAAT 14 00 
1423 



40 2) INFORMATION FOR SEQ ID NO: 1030 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 144 7 bases 

(B) TYPE: Nucleic acid 
45 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

50 (Vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-12 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1030 



TATTGCCTCT ATTTCAAAGG AGATGCCTGG CATTAGTATT TCTACTTCTT 50 

GGGATCGAAA GGTTTTGGAA ACTTCCCTTT CTTCTATAGT AGGGAGTGTA 100 

TCCAGTGAAA AAGCTGGTCT CCCAGCGGAA GAAGCAGAAT CCTATCTTAA 150 

AAAAGGCTAT TCTCTAAATG ACCGTGTTGG AACCTCCTAT TTGGAAAAGC 2 00 

60 AATATGAAGA GACCTTACAA GGAAAACGCT CGGTAAAAGA AATCCATCTG 250 

576 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 
GATAAATATG GCAATATGGA 
GGGAAACAAT ATCAAACTGA 
ATGCTTTGCT GAAAAGTTAT 
AAATATTCTG AAGGTGTCTA 
5 TGTTTTATCC ATGTCAGGGA 
CTCCTGATTC CTTGGGAACG 
GTCAAGGCTG CGACCATCAG 
AAACCAAACC TTAACAGATC 
TTTATTCTTG GTATAAATTG 

10 GAAGCCTTGG AGTATTCATC 
AATCATGGGC CAGACCTATC 
TGGAAACAGC TATGGGAAAA 
GGGGCTGCGA CCGGAATTGA 
CAAAGAGTAT AGCTTTGCTA 

15 ATAACTATAC GCCCATGCAG 
AATGGTGTTC GTGTGGCTCC 
TGATAAGGGA GGACTGGGTG 
TGAATAAGGT CAATATATCC 
TTTTATCAGG TTGCCCATGG 

20 TTCAAATGGT GCCTTGGTAT 
GCTATGTGGC AGATGGTCAG 
GCCCCATCTG ATAATCCCCA 
TACCAATCTA ACAAATGGTG 
ATCTGTATCA AAAATACCAT 

25 



AAGCGTGGAC ACAATTGAGG 
CCATTGATTT GGCCTTCCAA 
TTCAATTCCG AGCTAGGAAA 
TGCAGTCGCC CTTAACCCAA 
TCAAACATGA CCTGAAAACG 
GTAACCAATG TCTTTGTCCC 
CTCAGGTTGG GAAAATGGTG 
AGCCTATTGT TTTCCAAGGT 
GCATATGGAT CTTTTCCTAT 
CAATGCTTAC ATGGTTCAAA 
AACCAAATAT GTTTGTTGGA 
CTTCGTGCGA CCTTTGGCGA 
CCTACCAGAT GAATCTACTG 
ATTACATCAC CAATTCCTTT 
TTGGCTCAGT, ATGTAGCAAC 

tcgtattgtt' GAAGGCATTT 
ACTTGATTCA GCAACTGCAA 
GACTCCGATA TGAGCATCTT 
TACTAGTGGA TTGACAACTG 
CCATTAGCGG AAAAACAGGT 
CAAGCAACCA ATACCAATGC 
AATCGCTGTC GCAGTGGTCT 
TAGGACCTTC CATTGCGCGT 
CCAATGAACT AGAAAGGAAA 



PCT/CA00/01150 

AAGGTAGTAA 300 

GATAGCGTGG 350 

TGGTGGAGCT 4 00 

AAACAGGTGC 4 50 

GGAGAGTTGA 500 

AGGTTCGGTT 550 

TTTTATCAGG 600 

TCAGCTCCAA 650 

TACAGCTGTG 700 

CCGCTCTTGG 750 

ACCAGCAATT 800 

ATATGGCTTG 850 

GATTTGTTCC 900 

GGGCAGTTTG 950 

TATTGCAAAT 1000 

ATGGTAATAA 1050 

CCGACAGAGA 1100 

GCACCAAGGT 1150 

GACGTGCCTT 1200 

ACAGCCGAAA 1250 

GGTGGCCTAT 13 00 

TTCCTCATAA 13 50 

GACATTATCA 1400 

TTATGCT 14 4 7 



2) INFORMATION FOR SEQ ID NO: 1031 



30 



35 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 144 6 bases 

(B) , TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



40 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-13 



(xi) SEQUENCE DESCRIPTION: 



ATTGCCTCTA 

45 GGATAGAAAG 

CCAGTGAAAA 

AAAGGCTATT 

ATACGAGGAA 

ATAAATATGG 

50 GGAAACAATA 

TGCTTTGCTG 

AGTATTCTGA 
GTTTTGTCTA 

GCCTGATTCC 
55 TTAAGGCCGC 
AACCAAACCT 
TTATTCTTGG 
AAGCCTTGGA 
ATCATGGGCC 
60 GGAAACAGCT 



TTTCAAAGGA 
GTTTTGGAAA 
AGCTGGTCTC 
CTCTAAATGA 
ACCTTACAAG 
CAATATGGAA 
TCAAACTGAC 
AAAAGTTATT 
AGGCGTGTAT 
TGTCAGGACT 
TTGGGAACGG 
TACCATCAGC 
TAACAGATCA 
TATAAATTGG 
GTATTCATCT 
AGACCTATCA 
ATGGGAAAAC 



GATGCCTGGC 
CTTCTCTTTC 
CCAGCGGAAG 
CCGTGTTGGG 
GAAAACGCTC 
AGCGTGGATA 
CATTGATTTG 
TCAATTCCGA 
GCAGTCGCCC 
CAAACATGAC 
TAACCAATGT 
TCAGGTTGGG 
GCCTATTGTT 
CATATGGATC 
AATGCTTACA 
ACCAAATATG 
TTCGTGCGAC 

577 



ID NO: 1031 




ATTAGTATTT 


CTACCTCTTG 


50 


TTCTATAGTA 


GGGAGTGTAT 


100 


AAGCAGAAGC 


CTATCTTAAA 


150 


ACTTCTTACC 


TGGAAAAACA 


200 


GGTAAAAGAA 


ATCCATCTGG 


250 


CAATTGAGGA 


AGGTAGTAAG 


300 


GCCTTCCAAG 


ATAGCGTGGA 


350 


GCTAGGAAAT 


GGTGGAGCTA 


400 


TTAACCCCAA 


AACAGGTGCT 


450 


CTGAAAACGG 


GAGACTTGAC 


500 


CTTTGTCCCA 


GGGTCAGTAG 


550 


AAAATGGTGT 


TTTATCAGGA 


600 


TTCCAAGGTT 


CAGCTCCAAT 


650 


TTTTCCTATT 


ACAGCTGTGG 


700 


TGGTTCAAAC 


CGCTCTTGGA 


750 


TTTGTTGGAA 


CCAGCAATTT 


800 


CTTTGGCGAA 


TATGGCTTGG 


850 
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GGGCTGCGAC CGGAATTGAC 
AAAGAGTATA GCTTTGCTAA 
TAACTATACG CCGATGCAGT 
ATGGTGTTCG TGTGGCTCCT 
5 GATAAGGGAG GACTGGGTGA 
GAATAAGGTC AATATATCCG 
TTTATCAGGT TGCCCATGGT 
TCAAATGGCG CCTTGGTATC 
CTATGTGGCA GATGGTCAGC 
10 CCCCATCTGA TAATCCCCAA 
ACCAATCTAA CAAATGGTGT 
TCTGTATCAA AAATACCATC 



CTACCAGATG AATCTACTGG 
TTACATTACT AATGCCTTTG 
TGGCTCAGTA TGTAGCAACT 
CGTATTGTTG AAGGCATTTA 
CTTGATTCAG CAACTGCAAC 
ACTCCGATAT GAGCATCTTG 
ACTAGTGGAT TGACAACTGG 
CATTAGCGGA AAAACAGGTA 
AAGCAACCAA TACCAATGCG 
ATCGCTGTTG CAGTGGTCTT 
AGGACCTTCC ATTGCGCGTG 
CAATGAACTA GAAAGGAAAT 



PCT/CA00/01150 

ATTTGTTCCC 900 

GGCAGTTTGA 950 

ATTGCAAATG 1000 

TGGTAATAAT 1050 

CGACAGAGAT 1100 

CACCAAGGTT 1150 

ACGTGCCTTT 1200 

CAGCCGAAAG 1250 

GTGGCCTATG 1300 

TCCTCATAAT 1350 

ACATTATCAA 14 00 

TATGCT 1446 



15 



2) INFORMATION FOR SEQ ID NO: 1032 



(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 1446 bases 
20 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



25 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-14 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1032 

ATTGCCTCTA TTTCAAAGGA GATGCCTGGC ATTAGTATTT CTACTTCTTG 50 

GGATAGAAAG GTTTTGGAAA CTTCCCTTTC TTCTATAGTT GGGAGTGTAT 100 

CCAGTGAAAA AGCTGGTCTC CCAGCGGAAG AAGCAGAAGC CTATCTTAAA 150 

35 AAAGGCTATT CTCTAAATGA TCGTGTTGGA ACCTCCTATT TGGAAAAGCA 200 

ATATGAAGAG ACCTTACAAG GGAAACGCTC GGTAAAAGAA ATCCATCTGG 250 

ATAAATATGG CAATATGGAA AGTGTGGATA CAATTGAGGA AGGTAGTAAG 300 

GGAAACAATA TCAAGCTGAC CATTGATTTG GCCTTCCAAG ATAGCGTGGA 350 

TGCTTTGCTG AAAAGTTATT TCAATTCCGA GCTAGGAAAT GGTGGAGCCA 4 00 

4 0 AGTATTCTGA GGGTGTGTAT GCAGTCGCCC TTAACCCCAA AACAGGTGCT 450 

GTTTTGTCTA TGTCAGGACT CAAACATGAC CTGAAAACGG GAGAGTTGAC 500 

TCCTGATTCC TTGGGAACGG TAACCAATGT CTTTGTCCCA GGTTCGGTTG 550 

TCAAGGCTGC GACCATCAGC TCTGGCTGGG AAAATGGAGT CTTATCAGGA 600 

AACCAGACCT TGACAGACCA GTCCATTGTC TTTCAAGGTT CAGCTCCCAT 650 

45 CAATTCTTGG TATACTCAGG CTTACGGTTC ATTCCCTATC ACAGCAGTCC 700 

AAGCTCTGGA GTATTCATCT AATGCCTATA TGGTCCAAAC AGCTTTAGGT 750 

CTTATGGGGC AGACCTACCA ACCTAATATG TTTGTCGGCA CCAGCAACCT 800 

AGAGTCTGCT ATGGGGAAAT TGCGTTCAAC CTTTGGTGAA TATGGTTTGG 850 

GTTCTGCGAC CGGGATTGAC CTACCAGATG AATCTACTGG ATTTGTTCCC 900 

50 AAAGACTATA GCTTTGCTAA TTACATCACC AATGCCTTTG GGCAGTTTGA 950 

TAACTATACG CCGATGCAGT TGGCTCAGTA TGTAGCAACT ATTGCAAATG 1000 

ATGGTGTTCG TGTGGCTCCT CGTATTGTTG AAGGCATTTA TGGAAATAAT 1050 

GATAAGGGAG GCCTAGGCGA CTTGATTCAG CAACTGCAAC CGACAGAGAT 1100 

GAATAAGGTC AATATATCCG ACTCTGATAT GAGTATTTTG CACCAAGGTT 1150 

55 TTTATCAGGT TGCTCATGGG ACTAGTGGAT TGACAACTGG ACGTGCCTTT 1200 

TCAAATGGCG CAGCGGTATC CATTAGTGGA AAAACAGGTA CTGCCGAAAG 1250 

TTATGTTGAG GGTGGTCAAG AAGCTAACAA TACTAATGCT GTGGCCTATG 1300 

CACCATCAGA TAATCCTCAA ATCGCTGTAG CTGTTGTCTT CCCTCATAAC 13 50 

ACCAACCTTA CAAATGGTGT CGGACCTTCC ATTGCGCGCG ATATTATCAA 14 00 

60 CCTCTATAAC CAACATCATC CAATGAATTA GAAAGGAACA TATGCT 144 6 

578 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CAOO/01150 



2) INFORMATION FOR SEQ ID NO: 1033 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1446 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-15 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1033 

TATTGCCTCT ATTTCAAAGG AGATGCCTGG CATTAGTATT TCTACTTCTT 50 

GGGATAGAAA GGTTTTGGAA ACTTCCCTTT CTTCTATAGT TGGGAGTGTA 100 

TCCAGTGAAA AAGCTGGTCT CCCAGCGGAA GAAGCAGAAG CCTATCTTAA 150 

AAAAGGCTAT TCTCTAAATG ACCGTGTTGG AACCTCGTAT TTGGAAAAGC 200 

AATATGAAGA GACCTTACAA GGAAAACGCT CGGTAAAAGA AATCCATCTG 250 

GATAAATATG GCAATATGGA AAGCGTGGAT ACAATTGAGG AAGGTAGTAA 300 

GGGAAACAAT ATCAAGCTGA CCATTGATTT GGCCTTCCAA GATAGCGTGG 3 50 

ATGCTTTGCT GAAAAGTTAT TTCAATTCCG AGCTAGGAAA TGGTGGAGCC 400 

AAGTATTCTG AGGGTGTGTA TGCAGTCGCC CTTAACCCCA AAACAGGTGC 4 50 

TGTTTTGTCT ATGTCAGGAC TCAAACATGA CCTGAAAACG GGAGAGTTGA 500 

CTCCTGATTC CTTGGGAACG GTAACCAATG TCTTTGTCCC AGGTTCGGTT 550 

GTTAAGGCCG CTACCATCAG CTCAGGTTGG GAAAATGGTG TTTTATCAGG 600 

AAACCAAACC TTAACAGATC AGCCTATTGT TTTCCAAGGT TCAGCTCCAA 650 

TTTATTCTTG GTATAAATTG GCATATGGAT CTTTTCCTAT TACAGCTGTG 700 

GAAGCCTTGG AGTATTCATC CAATGCTTAC ATGGTTCAAA CCGCTCTTGG 750 

AATCATGGGC CAGACCTATC AACCAAATAT GTTTGTTGGA ACCAGCAATT 800 

TGGAAACAGC TATGGGAAAA CTTCGTGCGA CCTTTGGCGA ATATGGCTTG 850 

GGGGCTGCGA CCGGAATTGA CCTACCAGAT GAATCTACTG GATTTGTTCC 900 

CAAAGAGTAT AGCTTTGCTA ATTACATCAC CAATGCCTTT GGGCAGTTTG 950 

ATAACTATAC GCCCATGCAG TTGGCTCAGT ATGTAGCAAC TATTGCAAAT 1000 

GATGGTGTTC GTGTGGCTCC TCGTATTGTT GAAGGCATTT ATGGTAATAA 1050 

TGATAAGGGA GGACTGGGTG ACTTGATTCA GCAACTGCAA CCGACAGAGA 1100 

TGAATAAGGT CAATATATCC GACTCCGATA TGAGTATCTT GCACCAAGGT 1150 

TTTTATCAGG TTGCTCATGG GACTAGTGGA TTGACAACTG GACGTGCTTT 1200 

TTCAAATGGT GCCTTGGTAT CCATTAGTGG GAAAACAGGT ACTGCCGAAA 1250 

GTTATGTTGC AGGTGGTCAA GAAGCCAACA ATACCAACGC GGTGGCCTAT 1300 

GCCCCATCAG ATAATCCTCA AATCGCTGTT GCCGTTGTCT TCCCTCATAA 1350 

CACCAATCTA ACAAATGGTG TTGGACCTTC TATTGCACGC GATATTATCA 1400 

ACCTCTATAA CCAACACCAT CCAATGAATT AGAAAGGAAC TTATGC 1446 



2) INFORMATION FOR SEQ ID NO: 1034 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1670 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

579 



SUBSTITUTE SHEET (RULE 26) 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-01 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1034 

GGAAACTGCA GAGGTCAAGG GGATTGATTT TACAACCAGT CCCAATCGTA 50 

GTTATCCAAA CGGACAATTT GCTTCTAGTT TTATCGGACT AGCTCAGCTC 100 

10 CATGAAAATG AAGATGGCAG CAAGAGCTTA CTGGGAACCT CTGGAATGGA 150 

GAGTTCCTTG AACAGTATTC TTGCAGGGAC AGACGGTATT ATTACCTATG 200 

AAAAAGACCG TGTAGGAAAT ATCGTACCAG GTACAGAACT GGTATCGCAA 250 

CAAACTGTGG ATGGCAAGGA TGTTTATACA ACATTGTCTA GTCCGCTACA 300 

ATCTTTCATG GAAACTCAGA TGGATGCCTT TCTAGAAAAA GTAAAAGGTA 350 

15 AGTATATGAC CGCGACCTTG GTCAGTGCAA AGACCGGTGA AATCCTCGCT 4 00 

ACCACCCAAC GACCTACCTT TAATGCAGAT ACTAAAGAAG GAATCACTGA 450 

GGACTTTGTT TGGCGTGATA TTCTTTATCA AAGTAACTAT GAACCAGGAT 500 

CAGCCATGAA GGTTATGACG TTAGCTTCTT CTATTGATAA TAATACCTTC 550 

CCAAGTGGAG AATACTTCAA TAGCAGTGAA TTCAAAATAG CGGATGCGAC 600 

20 GACTCGAGAT TGGGATGTTA ATGATGGTTT GACTACTGGT GGGATGATGA 650 

CTTTCTTACA AGGTTTCGCT CACTCCAGTA ATGTTGGAAT GAGTCTACTT 700 

GAACAAAAAA TGGGAGATGC TACTTGGTTG GATTATCTAA AACGCTTTAA 750 

ATTTGGGGTT CCAACTCGCT TTGGCTTGAC AGATGAATAC GCTGGTCAAC 800 

TTCCAGCTGA TAATATTGTT AGTATTGCTC AAAGCTCATT TGGGCAAGGA 850 

25 . ATTTCAGTGA CACAAACACA AATGCTTCGT GCCTTTACAG CTATTGCTAA 900 

TGATGGAGTT ATGCTGGAGC CAAAATTTAT AAGTGCTATT TATGATACTA 950 

ACAATCAGTC TGTACGTAAG TCACAAAAAG AAATAGTAGG AAATCCTGTT 1000 

TCCAAAGAGG CAGCAAGCAC AACTCGAAAT CACATGATCT TAGTTGGGAC 1050 

GGACCCTCTA TATGGAACTA TGTATAATCA CTACACAGGA AAGCCAATTA 1100 

30 TAACAGTTCC TGGACAAAAT GTAGCAGTTA AATCCGGTAC GGCTCAAATC 1150 

GCTGATGAGA AAAATGGAGG ATACTTGGTT GGTTCTACCA ATTATATTTT 1200 

CTCAGTTGTG ACTATGAATC CTGCTGAAAA TCCTGATTTT ATCTTGTATG 1250 

TAACGGTTCA ACAGCCTGAG CATTATTCAG GTATCCAGTT GGGAGAATTT 1300 

GCCACCCCAA TCTTGGAGCG GGCTTCAGCT ATGAAAGAAT CTCTCAATCT 1350 

35 TCAATCTCCA GCCAAAAATT TAGATAAAGT TACGACAGAA TCTTCTTATG 1400 

CAATGCCTAG CATCAAGGAT ATTTCACCTG GTGAGTTGGC GGAAGCCTTA 14 50 

CGCCGAAATA TTGTGCAACC AATCGTTGTA GGTACTGGAA CAAAGATTAA 1500 

AGAGACTTCT GTAGAAGAAG GGACCAATCT TGCACCAAAC CAACAAGTTC 1550 

TCCTTTTATC GGATAAGGTA GAAGAAATTC CAGACATGTA TGGCTGGAAA 1600 

4 0 AAAGAGACTG CCGAGACCTT TGCTAAATGG TTGGATATTG AACTGGAATT 1650 

TGAAGGTTCA GGTTCCGTTG 1670 



45 2) INFORMATION FOR SEQ ID NO: 1035 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1683 bases 

(B) TYPE: Nucleic acid 
50 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

55 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-02 



60 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1035 

580 



SUBSTITUTE SHEET (RULE 26) 
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AAAGAATTGG 
CAATCGTAGT 
CTCAGCTCCA 
GGAATGGAGA 
5 TACCTATGAA 
TTTCCCAACA 
CCCCTCCAGT 
AAAAGGTAAG 
TCCTCGCTAC 

10 ATCACTGAGG 
ACCAGGATCA 
ATACCTTCCC 
GATGCGACGA 
GATGATGACT 

15 GTCTACTTGA 
CGCTTTAAAT 
TGGTCAACTT 
GGCAAGGAAT 
ATTGCTAATG 

20 TGATACTAAC 
ATCCTGTTTC 
GTTGGGACGG 
GCCAATTATA 
CTCAAATCGC 

25 TATATTTTCT 
CTTGTATGTA 
GAGAATTTGC 
CTCAATCTTC 
TTCTTATGCA 

30 AAGCCTTACG 
AAGATTAAAG 
ACAAGTTCTC 
GCTGGAAAAA 
CTGGAATTTG 

35 



AAACTGCAGA 
TACCCAAATG 
TGAAAATGAA 
GTTCCTTGAA 
AAGGATCGTC 
AACGGTGGAT 
CCTTCATGGA 
TATATGACCG 
CACCCAACGA 
ACTTTGTTTG 
GCCATGAAGG 
AAGTGGAGAA 
CTCGAGATTG 
TTCTTACAAG 
ACAAAAAATG 
TTGGGGTTCC 
CCAGCTGATA 
TTCAGTGACA 
ATGGAGTTAT 
AATCAGTCTG 
CAAAGAGGCA 
ACCCTCTATA 
ACAGTTCCTG 
TGATGAGAAA 
CAGTTGTGAC 
ACGGTTCAAC 
CACCCCAATC 
AATCTCCAGC 
ATGCCTAGCA 
CCGAAATATT 
AGACTTCTGT 
CTTTTATCGG 
AGAGACTGCC 
AAGGTTCAGG 



GGTCAAGGGG 

GACAATTTGC 

GATGGAAGCA 

CAGTATTCTT 

TGGGCAATAT 

GGCAAGGATG 

AACTCAGATG 

CGACCTTGGT 

CCTACCTTTA 

GCGTGATATT 

TTATGACGTT 

TACTTCAATA 

GGATGTTAAT 

GTTTCGCTCA 

GGAGATGCTA, 

AACTCGCTTT 

ATATTGTTAG 

CAAACACAAA 

GCTGGAGCCA 

TACGTAAGTC 

GCAAGCACAA 

TGGAACTATG 

GACAAAATGT 

AATGGAGGAT 

TATGAATCCT 

AGCCTGAGCA 

TTGGAGCGGG 

CAAAAATTTA 

TCAAGGATAT 

GTGCAACCAA 

AGAAGAAGGG 

ATAAGGTAGA 

GAGACCTTTG 

TTCCGTTGTT 



ATTGATTTTA 
TTCTAGTTTT 
AGAGTTTGCT 
GCAGGGACAG 
TGTACCCGGA 
TTTACACAAC 
GATGCCTTTC 
CAGTGCAAAG 
ATGCAGATAC 
CTTTATCAAA 
AGCTTCTTCT 
GCAGTGAATT 
GATGGTTTGA 
CTCCAGTAAT 
CTTGGTTGGA 
GGCTTGACAG 
TATTGCTCAA 
TGCTTCGTGC 
AAATTTATAA 
ACAAAAAGAA 
CTCGAAATCA 
TATAATCACT 
AGCAGTTAAA 
ACTTGGTTGG 
GCTGAAAATC 
TTATTCAGGT 
CTTCAGCTAT 
GATAAAGTTA 
TTCACCTGGT 
TCGTTGTAGG 
ACCAATCTTG 
AGAAATTCCA 
CTAAATGGTT 
CAG 



PCT/CAOO/01150 

CAACCAGTCC 50 

ATCGGTCTAG 100 

GGGAACCTCT 150 

ACGGCATTAT 200 

ACAGAACAAG 250 

CATTTCCAGC 300 

TAGAAAAAGT 350 

ACCGGTGAAA 400 

TAAAGAAGGA 450 

GTAACTATGA 500 

ATTGATAATA 550 

CAAAATAGCG 600 

CTACTGGTGG 650 

GTTGGAATGA 700 

TTATCTAAAA 750 

ATGAATACGC 800 

AGCTCATTTG 850 

CTTTACAGCT 900 

GTGCTATTTA 950 

ATAGTAGGAA 1000 

CATGATCTTA 1050 

ACACAGGAAA 110 0 

TCCGGTACGG 1150 

TTCTACCAAT 12 00 

CTGATTTTAT 12 50 

ATCCAGTTGG 1300 

GAAAGAATCT 1350 

CGACAGAATC 14 00 

GAGTTGGCGG 1450 

TACTGGAACA 1500 

CACCAAACCA 1550 

GACATGTATG 1600 

GGATATTGAA 1650 
1683 



2) INFORMATION FOR SEQ ID NO: 1036 

4 0 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1682 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



45 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 
50 (B) STRAIN: StrR-03 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1036 

TCAAAAAAGA ATTGGAAACT GCAGAGGTCA AGGGGATTGA TTTTACAACC 50 

55' AGTCCCAATC GTAGTTACCC AAATGGACAA TTTGCTTCTA GTTTTATCGG 100 

TCTAGCTCAG CTCCATGAAA ATGAAGATGG AAGCAAGAGT TTGCTGGGAA 150 

CCTCTGGAAT GGAGAGTTCC TTGAACAGTA TTCTTGCAGG GACAGACGGC 200 

ATTATTACCT ATGAAAAGGA TCGTCTGGGC AATATTGTAC CCGGAACAGA 250 

ACAAGTTTCC CAACAAACGG TGGATGGCAA GGATGTTTAC ACAACCATTT 300 

60 CCAGCCCCCT CCAGTCCTTC ATGGAAACTC AGATGGATGC CTTTCTAGAA 350 

581 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 
AAAGTAAAAG GTAAGTATAT GACCGCGACC 
TGAAATCCTC GCTACCACCC AACGACCTAC 
AAGGAATCAC TGAGGACTTT GTTTGGCGTG 
TATGAACCAG GATCAGCCAT GAAGGTTATG 
5 TAATAATACC TTCCCAAGTG GAGAATACTT 
TAGCGGATGC GACGACTCGA GATTGGGATG 
GGTGGGATGA TGACTTTCTT ACAAGGTTTC 
AATGAGTCTA CTTGAACAAA AAATGGGAGA 
TAAAACGCTT TAAATTTGGG GTTCCAACTC 

10 TACGCTGGTC AACTTCCAGC TGATAATATT 
ATTTGGGCAA GGAATTTCAG TGACACAAAC 
CAGCTATTGC TAATGATGGA GTTATGCTGG 
ATTTATGATA CTAACAATCA GTCTGTACGT 
AGGAAATCCT GTTTCCAAAG AGGCAGCAAG 

15 TCTTAGTTGG GACGGACCCT CTATATGGAA 
GGAAAGCCAA TTATAACAGT TCCTGGACAA 
TACGGCTCAA ATCGCTGATG AGAAAAATGG 
CCAATTATAT TTTCTCAGTT t GTGACTATGA 
TTTATCTTGT ATGTAACGGT TCAACAGCCT 

20 GTTGGGAGAA TTTGCCACCC CAATCTTGGA 
AATCTCTCAA TCTTCAATCT CCAGCCAAAA 
GAATCTTCTT ATGCAATGCC TAGCATCAAG 
GGCGGAAGCC TTACGCCGAA ATATTGTGCA 
GAACAAAGAT TAAAGAGACT TCTGTAGAAG 

25 AACCAACAAG TTCTCCTTTT ATCGGATAAG 
GTATGGCTGG AAAAAAGAGA CTGCCGAGAC 
TTGAACTGGA ATTTGAAGGT TCAGGTTCCG 



PCT/CA00/01150 

TTGGTCAGTG CAAAGACCGG 400 

CTTTAATGCA GATACTAAAG 4 50 

ATATTCTTTA TCAAAGTAAC 500 

ACGTTAGCTT CTTCTATTGA 550 

CAATAGCAGT GAATTCAAAA 600 

TTAATGATGG TTTGACTACT 650 

GCTCACTCCA GTAATGTTGG 700 

TGCTACTTGG TTGGATTATC 750 

GCTTTGGCTT GACAGATGAA 800 

GTTAGTATTG CTCAAAGCTC 850 

ACAAATGCTT CGTGCCTTTA 900 

AGCCAAAATT TATAAGTGCT 950 

AAGTCACAAA AAGAAATAGT 1000 

CACAACTCGA AATCACATGA 1050 

CTATGTATAA TCACTACACA 1100 

AATGTAGCAG TTAAATCCGG 1150 

AGGATACTTG GTTGGTTCTA 1200 

ATCCTGCTGA AAATCCTGAT 1250 

GAGCATTATT CAGGTATCCA 1300 

GCGGGCTTCA GCTATGAAAG 1350 

ATTTAGATAA AGTTACGACA 1400 

GATATTTCAC CTGGTGAGTT 1450 

ACCAATCGTT GTAGGTACTG 1500 

AAGGGACCAA TCTTGCACCA 1550 

GTAGAAGAAA TTCCAGACAT 16 00 

CTTTGCTAAA TGGTTGGATA 1650 

TT 1682 



30 



40 



2) INFORMATION FOR SEQ ID NO: 1037 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1681 bases 
35 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-04 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1037 

CAAAAAAGAA TTGGAAACTG CAGAGGTCAA GGGGATTGAT TTTACAACCA 50 

GTCCCAATCG TAGTTATCCA AACGGACAAT TTGCTTCTAG TTTTATCGGA 100 

CTAGCTCAGC TCCATGAAAA TGAAGATGGC AGCAAGAGCT TACTGGGAAC 150 

50 CTCTGGAATG GAGAGTTCCT TGAACAGTAT TCTTGCAGGG ACAGACGGTA 200 

TTATTACCTA TGAAAAAGAC CGTGTAGGAA ATATCGTACC AGGTACAGAA 250 

CTGGTATCGC AACAAACTGT GGATGGCAAG GATGTTTATA CAACATTGTC 300 

TAGTCCGCTA CAATCTTTCA TGGAAACTCA GATGGATGCC TTTCTAGAAA 350 

AAGTAAAAGG TAAGTATATG ACCGCGACCT TGGTCAGTGC AAAGACCGGT 4 00 

55 GAAATCCTCG CTACCACCCA ACGACCTACC TTTAATGCAG ATACTAAAGA 4 50 

AGGAATCACT GAGGACTTTG TTTGGCGTGA TATTCTTTAT CAAAGTAACT 500 

ATGAACCAGG ATCAGCCATG AAGGTTATGA CGTTAGCTTC TTCTATTGAT 550 

AATAATACCT TCCCAAGTGG AGAATACTTC AATAGCAGTG AATTCAAAAT 600 

AGCGGATGCG ACGACTCGAG ATTGGGATGT TAATGATGGT TTGACTACTG 650 

60 GTGGGATGAT GACTTTCTTA CAAGGTTTCG CTCACTCCAG TAATGTTGGA 700 

582 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CAO0/01 150 

ATGAGTCTAC TTGAACAAAA AATGGGAGAT GCTACTTGGT TGGATTATCT 750 

AAAACGCTTT AAATTTGGGG TTCCAACTCG CTTTGGCTTG ACAGATGAAT 800 

ACGCTGGTCA ACTTCCAGCT GATAATATTG TTAGTATTGC TCAAAGCTCA 850 

TTTGGGCAAG GAATTTCAGT GACACAAACA CAAATGCTTC GTGCCTTTAC 900 

5 AGCTATTGCT AATGATGGAG TTATGCTGGA GCCAAAATTT ATAAGTGCTA 950 

TTTATGATAC TAACAATCAG TCTGTACGTA AGTCACAAAA AGAAATAGTA 1000 

GGAAATCCTG TTTCCAAAGA GGCAGCAAGC ACAACTCGAA ATCACATGAT 1050 

CTTAGTTGGG ACGGACCCTC TATATGGAAC TATGTATAAT CACTACACAG 1100 

GAAAGCCAAT TATAACAGTT CCTGGACAAA ATGTAGCAGT TAAATCCGGT 1150 

10 ACGGCTCAAA TCGCTGATGA GAAAAATGGA GGATACTTGG TTGGTTCTAC 1200 

CAATTATATT TTCTCAGTTG TGACTATGAA TCCTGCTGAA AATCCTGATT 1250 

TTATCTTGTA TGTAACGGTT CAACAGCCTG AGCATTATTC AGGTATCCAG 1300 

TTGGGAGAAT TTGCCACCCC AATCTTGGAG CGGGCTTCAG CTATGAAAGA 1350 

ATCTCTCAAT CTTCAATCTC CAGCCAAAAA TTTAGATAAA GTTACGACAG 1400 

15 AATCTTCTTA TGCAATGCCT AGCATCAAGG , ATATTTCACC TGGTGAGTTG 1450 

GCGGAAGCCT TACGCCGAAA TATTGTGCAA CCAATCGTTG TAGGTACTGG 1500 

AACAAAGATT AAAGAGACTT CTGTAGAAGA AGGGACCAAT CTTGCACCAA 1550 

ACCAACAAGT TCTCCTTTTA TCGGATAAGG TAGAAGAAAT TCCAGACATG 1600 

TATGGCTGGA AAAAAGAGAC TGCCGAGACC TTTGCTAAAT GGTTGGATAT 1650 

20 TGAACTGGAA TTTGAAGGTT CAGGTTCCGT T 1681 



2) INFORMATION FOR SEQ ID NO: 1038 



25 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1685 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
30 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
35 (A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-05 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1038 

40 AAAAAAGAAT TGGAAACTGC AGAGGTCAAG GGGATTGATT TTACAACCAG 50 

TCCCAATCGT AGTTATCCAA ACGGACAATT TGCTTCTAGT TTTATCGGAC 100 

TAGCTCAGCT CCATGAAAAT GAAGATGGCA GCAAGAGCTT ACTGGGAACC 150 

TCTGGAATGG AGAGTTCCTT GAACAGTATT CTTGCAGGGA CAGACGGTAT 200 

TATTACCTAT GAAAAAGACC GTGTAGGAAA TATCGTACCA GGTACAGAAC 250 

45 TGGTATCGCA ACAAACTGTG GATGGCAAGG ATGTTTATAC AACATTGTCT 300 

AGTCCGCTAC AATCTTTCAT GGAAACTCAG ATGGATGCCT TTCTAGAAAA 350 

AGTAAAAGGT AAGTATATGA CCGCGACCTT GGTCAGTGCA AAGACCGGTG 400 

AAATCCTCGC TACCACCCAA CGACCTACCT TTAATGCAGA TACTAAAGAA 450 

GGAATCACTG AGGACTTTGT TTGGCGTGAT ATTCTTTATC AAAGTAACTA 500 

50 TGAACCAGGA TCAGCCATGA AGGTTATGAC GTTAGCTTCT TCTATTGATA 550 

ATAATACCTT CCCAAGTGGA GAATACTTCA ATAGCAGTGA ATTCAAAATA 600 

GCGGATGCGA CGACTCGAGA TTGGGATGTT AATGATGGTT TGACTACTGG 650 

TGGGATGATG ACTTTCTTAC AAGGTTTCGC TCACTCCAGT AATGTTGGAA 700 

TGAGTCTACT TGAACAAAAA ATGGGAGATG CTACTTGGTT GGATTATCTA 750 

55 AAACGCTTTA AATTTGGGGT TCCAACTCGC TTTGGCTTGA CAGATGAATA 800 

CGCTGGTCAA CTTCCAGCTG ATAATATTGT TAGTATTGCT CAAAGCTCAT 850 

TTGGGCAAGG AATTTCAGTG ACACAAACAC AAATGCTTCG TGCCTTTACA 900 

GCTATTGCTA ATGATGGAGT TATGCTGGAG CCAAAATTTA TAAGTGCTAT 950 

TTATGATACT AACAATCAGT CTGTACGTAA GTCACAAAAA GAAATAGTAG 1000 

60 GAAATCCTGT TTCCAAAGAG GCAGCAAGCA CAACTCGAAA TCACATGATC 1050 

583 
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TTAGTTGGGA CGGACCCTCT 
AAAGCCAATT ATAACAGTTC 
CGGCTCAAAT CGCTGATGAG 
AATTATATTT TCTCAGTTGT 
TATCTTGTAT GTAACGGTTC 
TGGGAGAATT TGCCACCCCA 
TCTCTCAATC TTCAATCTCC 
ATCTTCTTAT GCAATGCCTA 
CGGAAGCCTT ACGCCGAAAT 
ACAAAGATTA AAGAGACTTC 
CCAACAAGTT CTCCTTTTAT 
ATGGCTGGAA AAAAGAGACT 
GAACTGGAAT TTGAAGGTTC 



ATATGGAACT ATGTATAATC 
CTGGACAAAA TGTAGCAGTT 
AAAAATGGAG GATACTTGGT 
GACTATGAAT CCTGCTGAAA 
AACAGCCTGA GCATTATTCA 
ATCTTGGAGC GGGCTTCAGC 
AGCCAAAAAT TTAGATAAAG 
GCATCAAGGA TATTTCACCT 
ATTGTGCAAC CAATCGTTGT 
TGTAGAAGAA GGGACCAATC 
CGGATAAGGT AGAAGAAATT 
GCCGAGACCT TTGCTAAATG 
AGGTTCCGTT GTTCA 



PCT/CAOO/01150 

ACTACACAGG 1100 

AAATCCGGTA 1150 

TGGTTCTACC 1200 

ATCCTGATTT 1250 

GGTATCCAGT 1300 

TATGAAAGAA 1350 

TTACGACAGA 1400 

GGTGAGTTGG 1450 

AGGTACTGGA 1500 

TTGCACCAAA 1550 

CCAGACATGT 1600 

GTTGGATATT 1650 
1685 



2) INFORMATION FOR SEQ ID NO: 1039 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1679 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-06 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1039 

AAAAGAATTG GAAACTGCAG AGGTCAAGGG GATTGATTTT ACAACCAGTC 50 

CCAATCGTAG TTATCCAAAC GGACAATTTG CTTCTAGTTT TATCGGACTA 100 

GCTCAGCTCC ATGAAAATGA AGATGGCAGC AAGAGCTTAC TGGGAACCTC 150 

TGGAATGGAG AGTTCCTTGA ACAGTATTCT TGCAGGGACA GACGGTATTA 200 

TTACCTATGA AAAAGACCGT GTAGGAAATA TCGTACCAGG TACAGAACTG 250 

GTATCGCAAC AAACTGTGGA TGGCAAGGAT GTTTATACAA CATTGTCTAG 300 

TCCGCTACAA TCTTTCATGG AAACTCAGAT GGATGCCTTT CTAGAAAAAG 3 50 

TAAAAGGTAA GTATATGACC GCGACCTTGG TCAGTGCAAA GACCGGTGAA 4 00 

ATCCTCGCTA CCACCCAACG ACCTACCTTT AATGCAGATA CTAAAGAAGG 450 

AATCACTGAG GACTTTGTTT GGCGTGATAT TCTTTATCAA AGTAACTATG 500 

AACCAGGATC AGCCATGAAG GTTATGACGT TAGCTTCTTC TATTGATAAT 550 

AATACCTTCC CAAGTGGAGA ATACTTCAAT AGCAGTGAAT TCAAAATAGC 600 

GGATGCGACG ACTCGAGATT GGGATGTTAA TGATGGTTTG ACTACTGGTG 650 

GGATGATGAC TTTCTTACAA GGTTTCGCTC ACTCCAGTAA TGTTGGAATG 700 

AGTCTACTTG AACAAAAAAT GGGAGATGCT ACTTGGTTGG ATTATCTAAA 750 

ACGCTTTAAA TTTGGGGTTC CAACTCGCTT TGGCTTGACA GATGAATACG 800 

CTGGTCAACT TCCAGCTGAT AATATTGTTA GTATTGCTCA AAGCTCATTT 850 

GGGCAAGGAA TTTCAGTGAC ACAAACACAA ATGCTTCGTG CCTTTACAGC 900 

TATTGCTAAT GATGGAGTTA TGCTGGAGCC AAAATTTATA AGTGCTATTT 950 

ATGATACTAA CAATCAGTCT GTACGTAAGT CACAAAAAGA AATAGTAGGA 1000 

AATCCTGTTT CCAAAGAGGC AGCAAGCACA ACTCGAAATC ACATGATCTT 1050 

AGTTGGGACG GACCCTCTAT ATGGAACTAT GTATAATCAC TACACAGGAA 1100 

AGCCAATTAT AACAGTTCCT GGACAAAATG TAGCAGTTAA ATCCGGTACG 1150 
GCTCAAATCG CTGATGAGAA AAATGGAGGA TACTTGGTTG GTTCTACCAA , 1200 

TTATATTTTC TCAGTTGTGA CTATGAATCC TGCTGAAAAT CCTGATTTTA 1250 

TCTTGTATGT AACGGTTCAA CAGCCTGAGC ATTATTCAGG TATCCAGTTG 1300 

GGAGAATTTG CCACCCCAAT CTTGGAGCGG GCTTCAGCTA TGAAAGAATC 1350 

TCTCAATCTT CAATCTCCAG CCAAAAATTT AGATAAAGTT ACGACAGAAT 14 00 

584 
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CTTCTTATGC AATGCCTAGC ATCAAGGATA 
GAAGCCTTAC GCCGAAATAT TGTGCAACCA 
AAAGATTAAA GAGACTTCTG TAGAAGAAGG 
AACAAGTTCT CCTTTTATCG GATAAGGTAG 
5 GGCTGGAAAA AAGAGACTGC CGAGACCTTT 
ACTGGAATTT GAAGGTTCAG GTTCCGTTG 



PCT/CA00/01150 

TTTCACCTGG TGAGTTGGCG 1450 
ATCGTTGTAG GTACTGGAAC 1500 
GACCAATCTT GCACCAAACC 1550 
AAGAAATTCC AGACATGTAT 1600 
GCTAAATGGT TGGATATTGA 1650 

1679 



10 2) INFORMATION FOR SEQ ID NO: 1040 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1685 bases 

(B) TYPE: Nucleic acid 

15 (C) STRANDEDNESS : Double, 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

20 (vi) ORIGINAL SOURCE; 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-07 



25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 104 0 



TAAAAAGGAT CTAAAAGACG CTAGTGTTGA AGGAATTGAC TTCACAACTA 50 

GCCCTAATAG AAGCTATCCA AATGGACAAT TCGCTTCTAG TTTTATTGGT 100 

TTGGCCCAAC TCCATGAAAA TGAGGATGGT AGCAAGAGTT TGCTGGGAAC 150 

TTCTGGGATG GAGAGTTCCT TGAACAGTAT TCTTGCAGGG AAAGACGGTA 2 00 

30 TTATTACCTA TGAAAAAGAT CGTCTGGGTA ATATTGTCCC TGGAACAGAA 2 50 

CAAGTTTCCC AACAAACGGT AGATGGCAAG GATGTTTATA CAACATTGTC 3 00 

TAGTCCGCTA CAATCTTTCA TGGAAACTCA GATGGATGCC TTTCTAGAAA 350 

AAGTAAAAGG TAAGTATATG ACCGCGACCT TGGTCAGTGC AAAGACCGGT 4 00 

GAAATCCTCG CTACCACCCA ACGACCTACC TTTAATGCAG ATACTAAAGA 4 50 

35 AGGAATCACT GAGGACTTTG TTTGGCGTGA TATTCTTTAT CAAAGTAACT 500 

ATGAACCAGG ATCAGCCATG AAGGTTATGA CGTTAGCTTC TTCTATTGAT 550 

AATAATACCT TCCCAAGTGG AGAATACTTC AATAGCAGTG AATTCAAAAT 600 

AGCGGATGCG ACGACTCGAG ATTGGGATGT TAATGAAGGT TTGACTACTG 650 

GTGGGATGAT GACTTTCTTA CAAGGTTTCG CTCACTCCAG TAATGTTGGA 700 

40 ATGAGTCTAC TTGAACAAAA AATGGGAGAT GCTACTTGGT TGGATTATCT 750 

AAAACGCTTT AAATTTGGGG TTCCAACTCG CTTTGGCTTG ACAGATGAAT 800 

ACGCTGGTCA ACTTCCAGCT GATAATATTG TTAGTATTGC TCAAAGCTCA 850 

TTTGGGCAAG GAATTTCAGT GACACAAACA CAAATGCTTC GTGCCTTTAC 900 

AGCTATTGCT AATGATGGAG TTATGCTGGA GCCAAAATTT ATAAGTGCTA 950 

45 TTTATGATAC TAACAATCAG TCTGTACGTA AGTCACAAAA AGAAATAGTA 1000 

GGAAATCCTG TTTCCAAAGA GGCAGCAAGC ACAACTCGAA ATCACATGAT 1050 

CTTAGTTGGG ACGGACCCTC TATATGGAAC TATGTATAAT CACTACACAG 1100 

GAAAGCCAAT TATAACAGTT CCTGGACAAA ATGTAGCAGT TAAATCCGGT 1150 

ACGGCTCAAA TCGCTGATGA GAAAAATGGA GGATACTTGG TTGGTTCTAC 1200 

50 CAATTATATT TTCTCAGCTG TGACTATGAA TCCTGCTGAA AATCCTGATT 1250 

TTATCTTGTA TGTAACGGTT CAACAGCCTG AGCATTATTC AGGTATCCAG 13 00 

TTGGGAGAAT TTGCCACCCC AATCTTGGAG CGGGCTTCAG CTATGAAAGA 13 50 

ATCTCTCAAT CTTCAATCTC CAGCCAAAAA TTTAGATAAA GTTACGACAG 14 00 

AATCTTCTTA TGCAATGCCT AGCATCAAGG ATATTTCACC TGGTGAGTTG 14 50 

55 GCGGAAGCCT TACGCCGAAA TATTGTGCAA CCAATCGTTG TAGGTACTGG 1500 

AACAAAGATT AAAGAGACTT CTGTAGAAGA AGGGACCAAT CTTGCACCAA 1550 

ACCAACAAGT TCTCCTTTTA TCGGATAAGG TAGAAGAAAT TCCAGACATG 1600 

TATGGCTGGA AAAAAGAGAC TGCCGAGACC TTTGCTAAAT GGTTGGATAT 1650 

TGAACTGGAA TTTGAAGGTT CAGGTTCCGT CGTTC 1685 



60 
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PCT/CA00/01150 



10 



55 



2) INFORMATION FOR SEQ ID NO: 1041 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1696 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 
15 (B) STRAIN: StrR-08 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1041 

GTCTATCAAA AAAGAATTGG AAACTGCAGA GGTCAAGGGG ATTGATTTTA 50 

20 CAACCAGTCC CAATCGTAGT TATCCAAACG GACAATTTGC TTCTAGTTTT 100 

ATCGGACTAG CTCAGCTCCA TGAAAATGAA GATGGCAGCA AGAGCTTACT 150 

GGGAACCTCT GGAATGGAGA GTTCCTTGAA CAGTATTCTT GCAGGGACAG 200 

ACGGTATTAT TACCTATGAA AAAGACCGTG TAGGAAATAT CGTACCAGGT 250 

ACAGAACTGG TATCGCAACA AACTGTGGAT GGCAAGGATG TTTATACAAC 300 

25 ATTGTCTAGT CCGCTACAAT CTTTCATGGA AACTCAGATG GATGCCTTTC 350 

TAGAAAAAGT AAAAGGTAAG TATATGACCG CGACCTTGGT CAGTGCAAAG 4 00 

ACCGGTGAAA TCCTCGCTAC CACCCAACGA CCTACCTTTA ATGCAGATAC 450 

TAAAGAAGGA ATCACTGAGG ACTTTGTTTG GCGTGATATT CTTTATCAAA 500 

GTAACTATGA ACCAGGATCA GCCATGAAGG TTATGACGTT AGCTTCTTCT 550 

30 ATTGATAATA ATACCTTCCC AAGTGGAGAA TACTTCAATA GCAGTGAATT 600 

CAAAATAGCG GATGCGACGA CTCGAGATTG GGATGTTAAT GATGGTTTGA 650 

CTACTGGTGG GATGATGACT TTCTTACAAG GTTTCGCTCA CTCCAGTAAT 700 

GTTGGAATGA GTCTACTTGA ACAAAAAATG GGAGATGCTA CTTGGTTGGA 750 

TTATCTAAAA CGCTTTAAAT TTGGGGTTCC AACTCGCTTT GGCTTGACAG 800 

35 ATGAATACGC TGGTCAACTT CCAGCTGATA ATATTGTTAG TATTGCTCAA 850 

AGCTCATTTG GGCAAGGAAT TTCAGTGACA CAAACACAAA TGCTTCGTGC 900 

CTTTACAGCT ATTGCTAATG ATGGAGTTAT GCTGGAGCCA AAATTTATAA 950 

GTGCTATTTA TGATACTAAC AATCAGTCTG TACGTAAGTC ACAAAAAGAA 1000 

ATAGTAGGAA ATCCTGTTTC .CAAAGAGGCA GCAAGCACAA CTCGAAATCA 1050 

40 CATGATCTTA GTTGGGACGG ACCCTCTATA TGGAACTATG TATAATCACT 1100 

ACACAGGAAA GCCAATTATA ACAGTTCCTG GACAAAATGT AGCAGTTAAA 1150 

TCCGGTACGG CTCAAATCGC TGATGAGAAA AATGGAGGAT ACTTGGTTGG 12 00 

TTCTACCAAT TATATTTTCT CAGTTGTGAC TATGAATCCT GCTGAAAATC 1250 

CTGATTTTAT CTTGTATGTA ACGGTTCAAC AGCCTGAGCA TTATTCAGGT 1300 

45 ATCCAGTTGG GAGAATTTGC CACCCCAATC TTGGAGCGGG CTTCAGCTAT 1350 

GAAAGAATCT CTCAATCTTC AATCTCCAGC CAAAAATTTA GATAAAGTTA 14 00 

CGACAGAATC TTCTTATGCA ATGCCTAGCA TCAAGGATAT TTCACCTGGT 1450 

GAGTTGGCGG AAGCCTTACG CCGAAATATT GTGCAACCAA TCGTTGTAGG 1500 

TACTGGAACA AAGATTAAAG AGACTTCTGT AGAAGAAGGG ACCAATCTTG 1550 

50 CACCAAACCA ACAAGTTCTC CTTTTATCGG ATAAGGTAGA AGAAATTCCA 16 00 

GACATGTATG GCTGGAAAAA AGAGACTGCC GAGACCTTTG CTAAATGGTT 1650 

GGATATTGAA CTGGAATTTG AAGGTTCAGG TTCCGTTGTT CAGAAG 1696 



2) INFORMATION FOR SEQ ID NO: 1042 



. (i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1689 bases 
60 (B) TYPE: Nucleic acid 

586 
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(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-09 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1042 

AAAAAAGAGT TGGAAACTGC AGAGGTCAAG GGGATTGATT TTACAACCAG 50 

TCCCAATCGT AGTTACCCAA ACGGACAATT TGCTTCTAGT TTTATCGGAC 100 

TAGCTCAGCT CCATGAAAAT GAAGATGGCA GCAAGAGCTT GCTGGGAACC 150 

15 TCTGGAATGG AGAGTTCCTT GAACAGTATT , CTTGCAGGGA CAGACGGCAT 200 

TATTACCTAT GAAAAGGATC GTCTGGGAAA TATTGTCCCC GGAACGGAAC 250 

AAGTTTCCCA ACAAACTGTA GATGGCAAGG ATGTTTATAC GACTATTTCC 300 

AGCACCCTTC AGTCCTTCAT GGAAACTCAG ATGGATGCCT TTCTAGAAAA 350 

AGTAAAAGGT AAGTATATGA CCGCGACCTT GGTCAGTGCA AAGACCGGTG 4 00 

20 AAATCCTCGC TACCACCCAA CGACCTACCT TTAATGCAGA TACTAAAGAA 450 

GGAATCACTG AGGACTTTGT TTGGCGTGAT ATTCTTTATC AAAGTAACTA 500 

TGAACCAGGA TCAGCCATGA AGGTTATGAC GTTAGCTTCT TCTATTGATA 550 

ATAATACCTT CCCAAGTGGA GAATACTTCA ATAGCAGTGA ATTCAAAATA 600 

GCGGATGCGA CGACTCGAGA TTGGGATGTT AATGATGGTT TGACTACTGG 6 50 

25 TGGGATGATG ACTTTCTTAC AAGGTTTCGC TCACTCCAGT AATGTTGGAA 700 

TGAGTCTACT TGAACAAAAA ATGGGAGATG CTACTTGGTT GGATTATCTA 750 

AAACGCTTTA AATTTGGGGT TCCAACTCGC TTTGGCTTGA CAGATGAATA 800 

CGCTGGTCAA CTTCCAGCTG ATAATATTGT TAGTATTGCT CAAAGCTCAT 850 

TTGGGCAAGG AATTTCAGTG ACACAAACAC AAATGCTTCG TGCCTTTACA 900 

30 GCTATTGCTA ATGATGGAGT TATGCTGGAG CCAAAATTTA TAAGTGCTAT 950 

TTATGATACT AACAATCAGT CTGTACGTAA GTCACAAAAA GAAATAGTAG 1000 

GAAATCCTGT TTCCAAAGAG GCAGCAAGCA CAACTCGAAA TCACATGATC 1050 

TTAGTTGGGA CGGACCCTCT ATATGGAACT ATGTATAATC ACTACACAGG 1100 

AAAGCCAATT ATAACAGTTC CTGGACAAAA TGTAGCAGTT AAATCCGGTA 1150 

35 CGGCTCAAAT CGCTGATGAG AAAAATGGAG GATACTTGGT TGGTTCTACC 12 00 

AATTATATTT TCTCAGTTGT GACTATGAAT CCTGCTGAAA ATCCTGATTT 1250 

TATCTTGTAT GTAACGGTTC AACAGCCTGA GCATTATTCA GGTATCCAGT 1300 

TGGGAGAATT TGCCACCCCA ATCTTGGAGC GGGCTTCAGC TATGAAAGAC 1350 

TCTCTCAATC TTCAAACAAC AGCTAAAGCT TTGGAGCAAG TAAGTCAACA 1400 

4 0 AAGTCCTTAT CCTATGCCTA GTGTCAAGGA TATTTCACCT GGTGATTTAG 1450 

CAGAAGAATT GCGTCGCAAT CTTGTACAAC CCATCGTTGT GGGAACAGGA 1500 

ACGAAGATTA AAAACAGTTC TGCTGAAGAA GGGAAGAATC TTGCCCCGAA 1550 

TCAGCAAGTC CTTATCTTAT CTGATAAAGT AGAGGAAGTT CCAGATATGT 1600 

ATGGTTGGAC AAAGGAGACT GCTGAGACCC TTGCTAAGTG GCTCAATATA 1650 

4 5 GAACTTGAAT TTCAAGGTTC GGGTTCTACT GTGCAGAAG 1689 



50 



2) INFORMATION FOR SEQ ID NO: 1043 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1690 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
55 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
60 (A) ORGANISM: Streptococcus pneumoniae 

587 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

(B) STRAIN: StrR-10 



PCT/CA00/0J150 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1043 

CAAAAAAGAG TTGGAAACTG CAGAGGTCAA GGGGATTGAT TTTACAACCA 50 

GTCCTAATCG TAGTTACCCA AACGGACAAT TTGCTTCTAG TTTTATCGGT 100 

CTAGCTCAAC TCCATGAAAA TGAAGATGGA AGCAAGAGTT TGCTGGGAAC 150 

TTCTGGAATG GAGAGTTCCT TGAACAGTAT TCTTGCAGGG AAAGACGGTA 200 

TTATTACTTA TGAAAAGGAT CGTCTGGGTA ATATTGTCCC TGGAACAGAA 250 

CAAGTTTCCC AACAAACGGT AGATGGCAAG GATGTTTATA CAACATTGTC 300 

TAGTCCGCTC CAATCTTTCA TGGAAACTCA GATGGATGCC TTTCTAGAAA 350 

AAGTAAAAGG TAAGTATATG ACCGCGACCT TGGTCAGTGC AAAGACCGGT 400 

GAAATCCTCG CTACCACCCA ACGACCTACC TTTAATGCAG ATACTAAAGA 4 50 

AGGAATCACT GAGGACTTTG TTTGGCGTGA TATTCTTTAT CAAAGTAACT 500 

ATGAACCAGG ATCAGCCATG AAGGTTATGA CGTTAGCTTC TTCTATTGAT 550 

AATAATACCT TCCCAAGTGG AGAATACTTC AATAGCAGTG AATTCAAAAT 600 

AGCGGATGCG ACGACTCGAG ATTGGGATGT TAATGATGGT TTGACTACTG 650 

GTGGGATGAT GACTTTCTTA CAAGGTTTCG CTCACTCCAG TAATGTTGGA 700 

ATGAGTCTAC TTGAACAAAA AATGGGAGAT GCTACTTGGT TGGATTATCT 750 

AAAACGCTTT AAATTTGGGG TTCCAACTCG CTTTGGCTTG ACAGATGAAT 800 

ACGCTGGTCA ACTTCCAGCT GATAATATTG TTAGTATTGC TCAAAGCTCA 850 

TTTGGGCAAG GAATTTCAGT GACACAAACA CAAATGCTTC GTGCCTTTAC 900 

AGCTATTGCT AATGATGGAG TTATGCTGGA GCCAAAATTT ATAAGTGCTA 950 

TTTATGATAC TAACAATCAG TCTGTACGTA AGTCACAAAA AGAAATAGTA 1000 

GGAAATCCTG TTTCCAAAGA GGCAGCAAGC ACAACTCGAA ATCACATGAT 1050 

CTTAGTTGGG ACGGACCCTC TATATGGAAC TATGTATAAT CACTACACAG 1100 

GAAAGCCAAT TATAACAGTT CCTGGACAAA ATGTAGCAGT TAAATCCGGT 1150 

ACGGCTCAAA TCGCTGATGA GAAAAATGGA GGATACTTGG TTGGTTCTAC 1200 

CAATTATATT TTCTCAGTTG TGACTATGAA TCCTGCTGAA AATCCTGATT 1250 

TTATCTTGTA TGTAACGGTT CAACAGCCTG AGCATTATTC AGGTATCCAG 1300 

TTGGGAGAAT TTGCCACCCC AATCTTGGAG CGGGCTTCAG CTATGAAAGA 13 50 

ATCTCTCAAT CTTCAATCTC CAGCCAAAAA TTTAGATAAA GTTACGACAG 14 00 

AATCTTCTTA TGCAATGCCT AGCATCAAGG ATATTTCACC TGGTGAGTTG 14 50 

GCGGAAGCCT TACGCCGAAA TATTGTGCAA CCAATCGTTG TAGGTACTGG 1500 

AACAAAGATT AAAGAGACTT CTGTAGAAGA AGGGACCAAT CTTGCACCAA 1550 

ACCAACAAGT TCTCCTTTTA TCGGATAAGG TAGAAGAAAT TCCAGACATG 1600 

TATGGCTGGA AAAAAGAGAC TGCCGAGACC TTTGCTAAAT GGTTGGATAT 1650 

TGAACTGGAA TTTGAAGGTT CAGGTTCCGT TGTTCAGAAG 1690 



2) INFORMATION FOR SEQ ID NO: 1044 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1668 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-11 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1044 

AGTTGGAAAC TGCAGAGGTC AAGGGGATTG ATTTTACAAC CAGTCCTAAT 50 
CGTAGTTACC CAAACGGACA ATTTGCTTCT AGTTTTATCG GACTAGCTCA 100 
GCTCCATGAA AATGAAGATG GCAGCAAGAG CTTGCTGGGA ACCTCTGGGA 150 
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TGGAGAGTTC TTTAAATAGA ATTCTTGCAG 
TATGAAAAGG ATCGTCTGGG AAATATTGTC 
CCAACAAACG GTAGATGGCA AGGATGTTTA 
TTCAGTCCTT CATGGAGACC CAGATGAATG 
5 GGCAAGTATA TGACGGCTAC CTTGGTCAGT 
TGCAACAACG CAACGGCCGA CCTTCGATGC 
CCAAGGACTT TGTTTGGCGT GATATCCTCT 
GGGTCAACCA TGAAGGTCAT GACGCTTGCT 
TTTCCCAGGA GGAGAAGTTT TCAATAGTAG 

10 CGACAATTCG AGATTGGGAT GTTAATGATG 
ATGACTTTCT TACAAGGTTT CGCTCACTCC 
ACTTGAACAA AAAATGGGAG ATGCTACTTG 
TTAAGTTTGG GGTGCCGACG CGTTTTGGTC 
CAATTGCCTG CAGATAATAT TGTTAATATT 

15 GGGGATTTCA GTGACCCAGA CGCAAATGAT 
CTAATGATGG TGTTATGCTG GAGCCTAAAT 
CCAAATGATC AAACTGCTCG GAAATCTCAA 
TGTTTCTAAA GATGCAGCTA GTCTAACTCG 
GGACGGATCC GGTTTATGGA ACCATGTATA 

20 ACTGTAACTG TTCCTGGGCA AAATGTAGCC 
GATTGCTGAC GAGAAAAATG GTGGTTATCT 
TTTTCTCGGC TGTATCGATG AATCCGGCTG 
TATGTAACGG TACAGCAACC TGAACATTAT 
ATTTGCCACC CCAATCTTGG AGCGGGCTTC 

25 ATCTTCAATC TCCAGCCAAG AATTTAGATA 
TATGCAATGC CTAGCATCAA GGATATTTCA 
CTTACGCCGA AATATTGTGC AACCAATCGT 
TTAAAGAGAC TTCTGTAGAA GAAGGGACCA 
GTTCTCCTTT TATCGGATAA GGTAGAAGAA 

30 GAAAAAAGAG ACTGCTGAAA CCTTTGCTAA 
AATTTGAAGG GTCAGGTT 



PCT/CAOO/01150 

GGACAGACGG CATTATTACC 200 

CCCGGAACGG AACAAGTTTC 250 

TACGACTATT TCCAGCACCC 300 

CCTTTCAAGA AAAAGTAAAA 350 

GCTAAAACAG GGGAAATTCT 400 

TGATACTAAG GAAGGACTTA 450 

ATCAAAGTAA CTATGAGCCA 500 

GCTGCTATTG ATAATAACAC 550 

TGAATTAAAA ATAGCGGATG 600 

GTTTGACGAC TGGTGGGATG 650 

AGTAATGTTG GAATGAGTCT 700 

GTTGGATTAT CTAAACCGCT 750 

TGACTGATGA GTATTCAGGT 800 

GCCATGAGTG CATTTGGTCA 850 

TCGTGCCTTT ACAGCTATTG 900 

TTATTAGTGC CATTTATGAT 950 

AAAGAAATTG TGGGAAATCC 1000 

GACTAACATG ATTTTGGTAG 1050 

ACCACAGCAC AGGTAAGCCA 1100 

CTCAAGTCTG GTACGGCTCA 1150 

AGTCGGGTTA ACCAACTATA 1200 

AAAATCCTGA TTTTATCTTG 1250 

TCAGGTATCC AGTTGGGAGA 1300 

AGCTATGAAA GAATCTCTCA 1350 

AAGTTACGAC AGAATCTTCT 14 00 

CCTGGTGAGT TGGCGGAAGC 1450 

TGTAGGTACT GGAACAAAGA 1500 

ATCTTGCACC AAACCAACAA 1550 

ATTCCAGACA TGTATGGCTG 1600 

ATGGTTGGAT ATTGAGCTGG 1650 

1668 



35 2) INFORMATION FOR SEQ ID NO: 1045 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1680 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

45 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-12 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 104 5 



TTGGAAACTG CAGAGGTCAA GGGGATTGAT TTTACAACCA GTCCCAATCG 50 

TAGTTACCCA AATGGACAAT TTGCTTCTAG TTTTATCGGT CTAGCTCAGC 100 

TCCATGAAAA TGAAGATGGA AGCAAGAGTT TGCTGGGAAC CTCTGGAATG 150 
GAGAGTTCCT TGAACAGTAT TCTTGCAGGG ACAGACGGCA TTATTACCTA - 200 

55 TGAAAAGGAT CGTCTGGGCA ATATTGTACC CGGAACAGAA CAAGTTTCCC 250 

AACAAACGGT GGATGGCAAG GATGTTTACA CAACCATTTC CAGCCCCCTC 300 

CAGTCCTTCA TGGAAACTCA GATGGATGCC TTTCTAGAAA AAGTAAAAGG 350 

TAAGTATATG ACCGCGACCT TGGTCAGTGC AAAGACCGGT GAAATCCTCG 400 

CTACCACCCA ACGACCTACC TTTAATGCAG ATACTAAAGA AGGAATCACT 450 

60 GAGGACTTTG TTTGGCGTGA TATTCTTTAT CAAAGTAACT ATGAACCAGG 500 
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ATCAGCCATG 
TCCCAAGTGG 
ACGACTCGAG 
GACTTTCTTA 
5 TTGAACAAAA 
AAATTTGGGG 
ACTTCCAGCT 
GAATTTCAGT 
AATGATGGAG 

10 TAACAATCAG 
TTTCCAAAGA 
ACGGACCCTC 
TATAACAGTT 
TCGCTGATGA 

15 TTCTCAGTTG 
TGTAACGGTT 
TTGCCACCCC 
CTTCAATCTC 
TGCAATGCCT 

20 TACGCCGAAA 
AAAGAGACTT 
TCTCCTTTTA 
AAAAAGAGAC 
TTTGAAGGTT 

25 



AAGGTTATGA 
AGAATACTTC 
ATTGGGATGT 
CAAGGTTTCG 
AATGGGAGAT 
TTCCAACTCG 
GATAATATTG 
GACACAAACA 
TTATGCTGGA 
TCTGTACGTA 
GGCAGCAAGC 
TATATGGAAC 
CCTGGACAAA 
GAAAAATGGA 
TGACTATGAA 
CAACAGCCTG 
AATCTTGGAG 
CAGCCAAAAA 
AGCATCAAGG 
TATTGTGCAA 
CTGTAGAAGA 
TCGGATAAGG 
TGCCGAGACC 
CAGGTTCCGT 



CGT.TAGCTTC 
AATAGCAGTG 
TAATGATGGT 
CTCACTCCAG 
GCTACTTGGT 
CTTTGGCTTG 
TTAGTATTGC 
CAAATGCTTC 
GCCAAAATTT 
AGTCACAAAA 
ACAACTCGAA 
TATGTATAAT 
ATGTAGCAGT 
GGATACTTGG 
TCCTGCTGAA 
AGCATTATTC 
CGGGCTTCAG 
TTTAGATAAA 
ATATTTCACC 
CCAATCGTTG 
AGGGACCAAT 
TAGAAGAAAT 
TTTGCTAAAT 
TGTTCAGAAG 



TTCTATTGAT 
AATTCAAAAT 
TTGACTACTG 
TAATGTTGGA 
TGGATTATCT 
ACAGATGAAT 
TCAAAGCTCA 
GTGCCTTTAC 
ATAAGTGCTA 
AGAAATAGTA 
ATCACATGAT 
CACTACACAG 
TAAATCCGGT 
TTGGTTCTAC 
AATCCTGATT 
AGGTATCCAG 
CTATGAAAGA 
GTTACGACAG 
TGGTGAGTTG 
TAGGTACTGG 
CTTGCACCAA 
TCCAGACATG 
GGTTGGATAT 



PCT/CAOO/01150 

AATAATACCT 550 

AGCGGATGCG 600 

GTGGGATGAT 650 

ATGAGTCTAC 700 

AAAACGCTTT 750 

ACGCTGGTCA 800 

TTTGGGCAAG 850 

AGCTATTGCT 900 

TTTATGATAC 950 

GGAAATCCTG 1000 

CTTAGTTGGG 1050 

GAAAGCCAAT 1100 

GCGGCTCAAA 1150 

CAATTATATT 1200 

TTATCTTGTA 1250 

TTGGGAGAAT 1300 

ATCTCTCAAT 1350 

AATCTTCTTA 1400 

GCGGAAGCCT 1450 

AACAAAGATT 1500 

ACCAACAAGT 1550 

TATGGCTGGA 1600 

TGAACTGGAA 1650 
1680 



30 



35 



2) INFORMATION FOR SEQ ID NO: 1046 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1689 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



40 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-13 

(xi) SEQUENCE DESCRIPTION: S] 



AAAAAAGAGT 

45 TCCTAATCGT 
TAGCTCAACT 
TCTGGAATGG 
TATTACTTAT 
AAGTTTCCCA 

50 AGTCCGCTCC 
AGTAAAAGGT 
AAATCCTCGC 
GGAATCACTG 
TGAACCAGGA 

55 ATAATACCTT 
GCGGATGCGA 
TGGGATGATG 
TGAGTCTACT 
AAACGCTTTA 

60 CGCTGGTCAA 



TGGAAACTGC 
AGTTACCCAA 
CCATGAAAAT 
AGAGTTCCTT 
GAAAAGGATC 
ACAAACGGTA 
AATCTTTCAT 
AAGTATATGA 
TACCACCCAA 
AGGACTTTGT 
TCAGCCATGA 
CCCAAGTGGA 
CGACTCGAGA 
ACTTTCTTAC 
TGAACAAAAA 
AATTTGGGGT 
CTTCCAGCTG 



AGAGGTCAAG 
ACGGACAATT 
GAAGATGGAA 
GAACAGTATT 
GTCTGGGTAA 
GATGGCAAGG 
GGAAACTCAG 
CCGCGACCTT 
CGACCTACCT 
TTGGCGTGAT 
AGGTTATGAC 
GAATACTTCA 
TTGGGATGTT 
AAGGTTTCGC 
ATGGGAGATG 
TCCAACTCGC 
ATAATATTGT 

590 



ID NO: 1046 




GGGATTGATT 


TTACAACCAG 


50 


TGCTTCTAGT 


TTTATCGGTC 


100 


GCAAGAGTTT 


GCTGGGAACT 


150 


CTTGCAGGGA 


AAGACGGTAT 


200 


TATTGTCCCT 


GGAACAGAAC 


250 


ATGTTTATAC 


AACATTGTCT 


300 


ATGGATGCCT 


TTCTAGAAAA 


350 


GGTCAGTGCA 


AAGACCGGTG 


400 


TTAATGCAGA 


TACTAAAGAA 


450 


ATTCTTTATC 


AAAGTAACTA 


500 


GTTAGCTTCT 


TCTATTGATA 


550 


ATAGCAGTGA 


ATTCAAAATA 


600 


AATGATGGTT 


TGACTACTGG 


650 


TCACTCCAGT 


AATGTTGGAA 


700 


CTACTTGGTT 


GGATTATCTA 


750 


TTTGGCTTGA 


CAGATGAATA 


800 


TAGTATTGCT 


CAAAGCTCAT 


850 
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PCT/CA00/01150 



TTGGGCAAGG AATTTCAGTG 
GCTATTGCTA ATGATGGAGT 
TTATGATACT AACAATCAGT 
GAAATCCTGT TTCCAAAGAG 
5 TTAGTTGGGA CGGACCCTCT 
AAAGCCAATT ATAACAGTTC 
CGGCTCAAAT CGCTGATGAG 
AATTATATTT TCTCAGTTGT 
TATCTTGTAT GTAACGGTTC 

10 TGGGAGAATT TGCCACCCCA 
TCTCTCAATC TTCAATCTCC 
ATCTTCTTAT GCAATGCCTA 
CGGAAGCCTT ACGCCGAAAT 
ACAAAGATTA AAGAGACTTC 

15 CCAACAAGTT CTCCTTTTAT 
ATGGCTGGAA AAAAGAGACT 
GAACTGGAAT TTGAAGGTTC 



ACACAAACAC AAATGCTTCG 
TATGCTGGAG CCAAAATTTA 
CTGTACGTAA GTCACAAAAA 
GCAGCAAGCA CAACTCGAAA 
ATATGGAACT ATGTATAATC 
CTGGACAAAA TGTAGCAGTT 
AAAAATGGAG GATACTTGGT 
GACTATGAAT CCTGCTGAAA 
AACAGCCTGA GCATTATTCA 
ATCTTGGAGC GGGCTTCAGC 
AGCCAAAAAT TTAGATAAAG 
GCATCAAGGA TATTTCACCT 
ATTGTGCAAC CAATCGTTGT 
TGTAGAAGAA GGGACCAATC 
CGGATAAGGT , AGAAGAAATT 
GCCGAGACCT TTGCTAAATG 
AGGTTCCGTT GTTCAGAAG 



TGCCTTTACA 900 

TAAGTGCTAT 950 

GAAATAGTAG 1000 

TCACATGATC 1050 

ACTACACAGG 1100 

AAATCCGGTA 1150 

TGGTTCTACC 1200 

ATCCTGATTT 1250 

GGTATCCAGT 13 00 

TATGAAAGAA 1350 

TTACGACAGA 1400 

GGTGAGTTGG 1450 

AGGTACTGGA 1500 

TTGCACCAAA 1550 

CCAGACATGT 1600 

GTTGGATATT 1650 
1689 



20 



2) INFORMATION FOR SEQ ID NO: 1047 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1690 bases 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



30 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-14 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1047 

CAAAAAAGAA TTGGAAGCTG CAGAGGTCAA GGGGATTGAT TTTACAACCA 50 

GTCCCAACCG TAGTTACCCA AACGGACAAT TTGCTTCTAG TTTTATCGGA 100 

CTAGCTCAGC TCCATGAAAA TGAAGATGGC AGCAAGAGCT TGCTGGGAAC 150 

4 0 TTCTGGAATG GAGAGTTCCT TGAACAGTAT TCTTGCAGGG ACAGACGGCA 2 00 

TTATTACCTA TGAAAAGGAT CGTCTGGGTA ATATTGTACC CGGAACAGAA 2 50 

CAAGTTTCCC AACAAACGGT AGATGGCAAG GATGTTTACA CAACCATTTC 3 00 

CAGCCCCCTC CAGTCCTTTA TGGAAACCCA GATGGATGCT TTTCAAGAGA 350 

AGGTAAAAGG AAAGTACACG ACAGCGACTT TGGTCAGTGC TAAAACAGGG 4 00 

4 5 GAAATTCTGG CAACAACGCA ACGACCGACC TTTGATGCAG ATACAAAAGA 4 50 

AGGCATTACA GAGGACTTTG TTTGGCGTGA TATCCTTTAC CAAAGTAACT 500 

ATGAGCCAGG TTCCCCTATG AAAGTGATGA TGTTGGCTGC TGCTATTGAT 550 

AATAATACCT TTCCAGGGGG AGAAGTCTTC AATAGTAGTG AGTTAAAAAT 600 

TGCAGATGTC ACGATTCGAG ATTGGGACGT CAATGAAGGA TTGACTGGTG 650 

50 GCAGAATGAT GACCTTTTCT CAAGGGTTCG CTCACTCAAG TAACGTTGGG 700 

ATGACGCTTC TTGAGCAAAA GATGGGAGAT GCTACATGGC TTGATTATCT 750 

AAATCGCTTT AAATTTGGTG TTCCGACTCG TTTTGGCTTG ACGGATGAAT 800 

ATGCAGGCCA ACTTCCAGCT GACAATATCG TAAATATTGC TCAGAGTTCA 850 

TTTGGACAAG GGATTTCAGC GACCCAGACG CAAATGATTC GTGCCTTCAC 900 

55 GGCTATTGCC AACGATGGAG TCATGTTAGA ACCTAAATTT ATCAGTGCCA 950 

TTTATGATCC AAATGATCAA ACTGCTCGGA AATCACAAAA AGAAGTTGTG 1000 

GGAAATCCTG TGTCTAAAGA TGCAGCGAGC TTGACGCGAA CGCATATGGT 1050 

TTTAGTCGGT ACCGATCCAG TATATGGAAC TATGTATAAT CATAAGACAG 1100 

GGAAACCAAC TGTAACTGTT CCTGGGCAAA ATGTAGCCCT CAAGTCTGGT 1150 

60 ACGGCTCAGA TTGCCGATGA GAAAAATGGA GGTTACTTAG TTGGTACGAC 1200 

591 
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CAATTACATT TTTTCGGCTG 
TTATTCTCTA TGTGACGGTT 
TTGGGGGAAT TTGCCAATCC 
TTCCCTTAAC CTCCAATCTA 
5 AAAGCGCTTA TGCCATGCCT 
GCGGAAGCCT TACGTCGCAA 
AACAAAAATT AAAGAATCAT 
ACCAGCAAGT TCTTCTCTTA 
TATGGTTGGA CAAAAGAGAC 
10 AGAACTTGAA TTTGAAGGTT 



TATCGATGAA CCCTGCTGAA 
CAACAGCCTG AGCATTATTC 
TATCTTGGAA AGGGCAGTGG 
CCGCTAAAAC GTTAAATCAG 
AGCATCAAGG ACATTTCACC 
TATTGTGCAA CCAATCGTTG 
CTGTAGAAGA AGGGACGGAT 
TCTGATAAAG CAGAGGAAGT 
TGCTGAGACC TTTGCTAAGT 
CGGGCTCTAC TGTGCAGAAG 



PCT/CAOO/01150 

AATCCTGATT 1250 

AGGTATTCAG 1300 

CTATGAAAGA 1350 

GTAACCAATC 14 00 

TGGCGATTTG 14 50 

TAGGAACAGG 1500 

CTTGCACCTA 1550 

TCCAGATATG 1600 

GGCTCAATAT 1650 
1690 



2) INFORMATION FOR SEQ ID NO:. 1048 



15 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1682 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
20 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
25 (A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-15 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 104 8 

30 AAAAGAGTTG GAAACTGCAG AGGTCAAGGG GATTGATTTT ACAACCAGTC 50 

CCAATCGTAG TTACCCAAAC GGACAATTTG CTTCTAGCTT TATTGGCTTA 100 

GCCCAACTTC ATGAAAATGA GGATGGTAGT AAGAGTTTGT TAGGGACTTC 150 

TGGTTTGGAG AGTTCTTTAA ATACCATTCT TGCTGGGACA GACGGTATTA 2 00 

TTACCTATGA AAAAGACCGT GTAGGAAATA TCGTACCAGG TACAGAACTG 2 50 

35 GTATCGCAAC AAACTGTGGA TGGCAAGGAT GTTTATACAA CATTGTCTAG 3 00 

TCCGCTACAA TCTTTCATGG AAACTCAGAT GGATGCCTTT CTAGAAAAAG 3 50 

TAAAAGGTAA GTATATGACC GCGACCTTGG TCAGTGCAAA GACCGGTGAA 4 00 

ATCCTCGCTA CCACCCAACG ACCTACCTTT AATGCAGATA CTAAAGAAGG 4 50 

AATCACTGAG GACTTTGTTT GGCGTGATAT TCTTTATCAA AGTAACTATG 500 

4 0 AACCAGGATC AGCCATGAAG GTTATGACGT TAGCTTCTTC TATTGATAAT 550 

AATACCTTCC CAAGTGGAGA ATACTTCAAT AGCAGTGAAT TCAAAATAGC 600 

GGATGCGACG ACTCGAGATT GGGATGTTAA TGAAGGTTTG ACTACTGGTG 650 

GGATGATGAC TTTCTTACAA GGTTTCGCTC ACTCCAGTAA TGTTGGAATG 700 

AGTCTACTTG AACAAAAAAT GGGAGATGCT ACTTGGTTGG ATTATCTAAA 750 

45 ACGCTTTAAA TTTGGGGTTC CAACTCGCTT TGGCTTGACA GATGAATACG 800 

CTGGTCAACT TCCAGCTGAT AATATTGTTA GTATTGCTCA AAGCTCATTT 850 

GGGCAAGGAA TTTCAGTGAC ACAAACACAA ATGCTTCGTG CCTTTACAGC 900 

TATTGCTAAT GATGGAGTTA TGCTGGAGCC AAAATTTATA AGTGCTATTT 950 

ATGATACTAA CAATCAGTCT GTACGTAAGT CACAAAAAGA AATAGTAGGA 1000 

50 AATCCTGTTT CCAAAGAGGC AGCAAGCACA ACTCGAAATC ACATGATCTT 1050 

AGTTGGGACG GACCCTCTAT ATGGAACTAT GTATAATCAC TACACAGGAA 1100 

AGCCAATTAT AACAGTTCCT GGACAAAATG TAGCAGTTAA ATCCGGTACG 1150 

GCTCAAATCG CTGATGAGAA AAATGGAGGA TACTTGGTTG GTTCTACCAA 1200 

TTATATTTTC TCAGTTGTGA CTATGAATCC TGCTGAAAAT CCTGATTTTA 1250 

55 TCTTGTATGT AACGGTTCAA CAGCCTGAGC ATTATTCAGG TATCCAGTTG 1300 

GGAGAATTTG CCACCCCAAT CTTGGAGCGG GCTTCAGCTA TGAAAGAATC 1350 

TCTCAATCTT CAATCTCCAG CCAAAAATTT AGATAAAGTT ACGACAGAAT 14 00 

CTTCTTATGC AATGCCTAGC ATCAAGGATA TTTCACCTGG TGAGTTGGCG 14 50 

GAAGCCTTAC GCCGAAATAT TGTGCAACCA ATCGTTGTAG GTACTGGAAC 1500 

60 AAAGATTAAA GAGACTTCTG TAGAAGAAGG GACCAATCTT GCACCAAACC 1550 
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AACAAGTTCT CCTTTTATCG GATAAGGTAG 
GGCTGGAAAA AAGAGACTGC TGAAACCTTT 
GTTGGAATTT GAAGGTTCAG GTTCCGTCGT 
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AAGAAATTCC AGACATGTAT 1600 
GCTAAATGGT TGGATATTGA 1650 
TC 1682 



2) INFORMATION FOR SEQ ID NO: 1049 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1241 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecium 

(B) STRAIN: R690 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1049 

GCAGCGTTGC GTGATACCGT TGAAAAAACC ATTAAAAACT GTTTGGATTT 50 

TGAAAGGAGA CAGGAGCATG AATAGAATAA AAGTTGCAAT ACTGTTTGGG 100 

GGTTGCTCAG AGGAGCATGA CGTATCGGTA AAATCTGCAA TAGAGATAGC 150 

CGCTAACATT AATAAAGAAA AATACGAGCC GTTATACATT GGAATTACGA 200 

AATCTGGTGT ATGGAAAATG TGCGAAAAAC CTTGCGCGGA ATGGGAAAAC 250 

GACAATTGCT ATTCAGCTGT ACTCTCGCCG GATAAAAAAA TGCACGGATT 300 

ACTTGTTAAA AAGAACCATG AATATGAAAT CAACCATGTT GATGTAGCAT 350 

TTTCAGCTTT GCATGGCAAG TCAGGTGAAG ATGGATCCAT ACAAGGTCTG 4 00 

TTTGAATTGT CCGGTATCCC TTTTGTAGGC TGCGATATTC AAAGCTCAGC 450 

AATTTGTATG GACAAATCGT TGACATACAT CGTTGCGAAA AATGCTGGGA 500 

TAGCTACTCC CGCCTTTTGG GTTATTAATA AAGATGATAG GCCGGTGGCA 550 

GCTACGTTTA CCTATCCTGT TTTTGTTAAG CCGGCGCGTT CAGGCTCATC 600 

CTTCGGTGTG AAAAAAGTCA ATAGCGCGGA CGAATTGGAC TACGCAATTG 650 

AATCGGCAAG ACAATATGAC AGCAAAATCT TAATTGAGCA GGCTGTTTCG 700 

GGCTGTGAGG TCGGTTGTGC GGTATTGGGA AACAGTGCCG CGTTAGCTGT 750 

TGGCGAGGTG GACCAAATCA GGCTGCAGTA CGGAATCTTT CGTATTCATC 800 

AGGAAGTCGA GCCGGAAAAA GGCTCTGAAA ACGCAGTTAT AACCGTTCCC 850 

GCAGACCTTT CAGCAGAGGA GCGAGGACGG ATACAGGAAA CGGCAAAAAA 900 

AATATATAAA GCGCTCGGCT GTAGAGGTCT AGCCCGTGTG GATATGTTTT 950 

TACAAGATAA CGGCCGCATT GTACTGAACG AAGTCAATAC TCTGCCCGGT 1000 

TTCACGTCAT ACAGTCGTTA TCCCCGTATG ATGGCCGCTG CAGGTATTGC 1050 

ACTTCCCGAA CTGATTGACC GCTTGATCGT ATTAGCGTTA AAGGGGTGAT 1100 

AAGCATGGAA ATAGGATTTA CTTTTTTAGA TGAAATAGTA CACGGTGTTC 1150 

GTTGGGACGC TAAATATGCC ACTTGGGATA ATTTCACCGG AAAACCGGTT 1200 

GACGGTTATG AAGTAAATCG CATTGTAGGG ACATACGAAT T 1241 



2) INFORMATION FOR SEQ ID NO: 1050 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1249 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus gallinarum 

(B) STRAIN: R691 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1050 

TCACACCGCA TACGGCCTAT TATACCGAGC AAGCGTTGCG TGATACCGTT 50 

GAAAAAACCA TTAAAAACTG TTTGGATTTT GAAAGGAGAC AGGAGCATGA 100 

ATAGAATAAA AGTTGCAATA CTGTTTGGGG GTTGCTCAGA GGAGCATGAC 150 

GTATCGGTAA AATCTGCAAT AGAGATAGCC GCTAACATTA ATAAAGAAAA 200 

ATACGAGCCG TTATACATTG GAATTACGAA ATCTGGTGTA TGGAAAATGT 250 

GCGAAAAACC TTGCGCGGAA TGGGAAAACG ACAATTGCTA TTCAGCTGTA 300 

CTCTCGCCGG ATAAAAAAAT GCACGGATTA CTTGTTAAAA AGAACCATGA 350 

ATATGAAATC AACCATGTTG ATGTAGCATT TTCAGCTTTG CATGGCAAGT 400 

CAGGTGAAGA TGGATCCATA CAAGGTCTGT TTGAATTGTC CGGTATCCCT 450 

TTTGTAGGCT GCGATATTCA AAGCTCAGCA ATTTGTATGG ACAAATCGTT 500 

GACATACATC GTTGCGAAAA ATGCTGGGAT AGCTACTCCC GCCTTTTGGG 550 

TTATTAATAA AGATGATAGG CCGGTGGCAG CTACGTTTAC CTATCCTGTT 600 

TTTGTTAAGC CGGCGCGTTC AGGCTCATCC TTCGGTGTGA AAAAAGTCAA 650 

TAGCGCGGAC GAATTGGACT ACGCAATTGA ATCGGCAAGA CAATATGACA 700 

GCAAAATCTT AATTGAGCAG GCTGTTTCGG GCTGTGAGGT CGGTTGTGCG 750 

GTATTGGGAA ACAGTGCCGC GTTAGCTGTT GGCGAGGTGG ACCAAATCAG 800 

GCTGCAGTAC GGAATCTTTC GTATTCATCA GGAAGTCGAG CCGGAAAAAG 850 

GCTCTGAAAA CGCAGTTATA ACCGTTCCCG CAGACCTTTC AGCAGAGGAG 900 

CGAGGACGGA TACAGGAAAC GGCAAAAAAA ATATATAAAG CGCTCGGCTG 950 

TAGAGGTCTA GCCCGTGTGG ATATGTTTTT ACAAGATAAC GGCCGCATTG 1000 

TACTGAACGA AGTCAATACT CTGCCCGGTT TCACGTCATA CAGTCGTTAT 1050 

CCCCGTATGA TGGCCGCTGC AGGTATTGCA CTTCCCGAAC TGATTGACCG 1100 

CTTGATCGTA TTAGCGTTAA AGGGGTGATA AGCATGGAAA TAGGATTTAC 1150 

TTTTTTAGAT GAAATAGTAC ACGGTGTTCG TTGGGACGCT AAATATGCCA 1200 

CTTGGGATAA TTTCACCGGA AAACCGGTTG ACGGTTATGA AGTAAATCG 124 9 



2) INFORMATION FOR SEQ ID NO: 1051 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1272 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecium 

(B) STRAIN: R481 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1051 

TATACCGAGC AAGCGTTGCG TGATACCGTT GAAAAAACCA TTAAAAACTG 50 

TTTGGATTTT GAAAGGAGAC AGGAGCATGA ATAGAATAAA AGTTGCAATA 100 

CTGTTTGGGG GTTGCTCAGA GGAGCATGAC GTATCGGTAA AATCTGCAAT 150 

AGAGATAGCC GCTAACATTA ATAAAGAAAA ATACGAGCCG TTATACATTG 200 

GAATTACGAA ATCTGGTGTA .TGGAAAATGT GCGAAAAACC TTGCGCGGAA 2 50 

TGGGAAAACG ACAATTGCTA TTCAGCTGTA CTCTCGCCGG ATAAAAAAAT 3 00 

GCACGGATTA CTTGTTAAAA AGAACCATGA ATATGAAATC AACCATGTTG 350 

ATGTAGCATT TTCAGCTTTG CATGGCAAGT CAGGTGAAGA TGGATCCATA 4 00 

CAAGGTCTGT TTGAATTGTC CGGTATCCCT TTTGTAGGCT GCGATATTCA 450 

AAGCTCAGCA ATTTGTATGG ACAAATCGTT GACATACATC GTTGCGAAAA 500 
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ATGCTGGGAT AGCTACTCCC GCCTTTTGGG 
CCGGTGGCAG CTACGTTTAC CTATCCTGTT 
AGGCTCATCC TTCGGTGTGA AAAAAGTCAA 
ACGCAATTGA ATCGGCAAGA CAATATGACA 
5. GCTGTTTCGG GCTGTGAGGT CGGTTGTGCG 
GTTAGTTGTT GGCGAGGTGG ACCAAATCAG 
GTATTCATCA GGAAGTCGAG CCGGAAAAAG 
ACCGTTCCCG CAGACCTTTC AGCAGAGGAG 
GGCAAAAAAA ATATATAAAG CGCTCGGCTG 

10 ATATGTTTTT ACAAGATAAC GGCCGCATTG 
CTGCCCGGTT TCACGTCATA CAGTCGTTAT 
AGGTATTGCA CTTCCCGAAC TGATTGACCG 
AGGGGTGATA AGCATGGAAA TAGGATTTAC 
ACGGTGTTCG TTGGGACGCT AAATATGCCA 

15 AAACCGGTTG ACGGTTATGA AGTAAATCGC , 
GGCTTGAATC GCTTTTTGAA GG 
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TTATTAATAA AGATGATAGG 550 

TTTGTTAAGC CGGCGCGTTC 600 

TAGCGCGGAC GAATTGGACT 650 

GCAAAATCTT AATTGAGCAG 700 

GTATTGGGAA ACAGTGCCGC 750 

GCTGCAGTAC GGAATCTTTC 800 

GCTCTGAAAA CGCAGTTATA 850 

CGAGGACGGA TACAGGAAAC 900 

TAGAGGTCTA GCCCGTGTGG 950 

TACTGAACGA AGTCAATACT 1000 

CCCCGTATGA TGGCCGCTGC 1050 

CTTGATCGTA TTAGCGTTAA 1100 

TTTTTTAGAT GAAATAGTAC 1150 

CTTGGGATAA TTTCACCGGA 1200 

ATTGTAGGGA CATACGAATT 1250 

1272 



20 2) INFORMATION FOR SEQ ID NO: 1052 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1237 bases 

(B) TYPE: Nucleic acid 
25 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

30 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecium 

(B) STRAIN: R492 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1052 



TCCCCCGGCA TACGGCCTAT TATACCGAGC AAGCGTTGCG TGATACCGTT 50 

GAAAAAACCA TTAAAAACTG TTTGGATTTT GAAAGGAGAC AGGAGCATGA 100 

ATAGAATAAA AGTTGCAATA CTGTTTGGGG GTTGCTCAGA GGAGCATGAC 150 

GTATCGGTAA AATCTGCAAT AGAGATAGCC GCTAACATTA ATAAAGAAAA 2 00 

40 ATACGAGCCG TTATACATTG GAATTACGAA ATCTGGTGTA TGGAAAATGT 2 50 

GCGAAAAACC TTGCGCGGAA TGGGAAAACG ACAATTGCTA TTCAGCTGTA 3 00 

CTCTCGCCGG ATAAAAAAAT GCACGGATTA CTTGTTAAAA AGAACCATGA 3 50 

ATATGAAATC AACCATGTTG ATGTAGCATT TTCAGCTTTG CATGGCAAGT 4 00 

CAGGTGAAGA TGGATCCATA CAAGGTCTGT TTGAATTGTC CGGTATCCCT 4 50 

45 TTTGTAGGCT GCGATATTCA AAGCTCAGCA ATTTGTATGG ACAAATCGTT 500 

GACATACATC GTTGCGAAAA ATGCTGGGAT AGCTACTCCC GCCTTTTGGG 550 

TTATTAATAA AGATGATAGG CCGGTGGCAG CTACGTTTAC CTATCCTGTT 600 

TTTGTTAAGC CGGCGCGTTC AGGCTCATCC TTCGGTGTGA AAAAAGTCAA 650 

TAGCGCGGAC GAATTGGACT ACGCAATTGA ATCGGCAAGA CAATATGACA 700 

50 GCAAAATCTT AATTGAGCAG GCTGTTTCGG GCTGTGAGGT CGGTTGTGCG 750 

GTATTGGGAA ACAGTGCCGC GTTAGCTGTT GGCGAGGTGG ACCAAATCAG 800 

GCTGCAGTAC GGAATCTTTC GTATTCATCA GGAAGTCGAG CCGGAAAAAG 850 

GCTCTGAAAA CGCAGTTATA ACCGTTCCCG CAGACCTTTC AGCAGAGGAG 900 

CGAGGACGGA TACAGGAAAC GGCAAAAAAA ATATATAAAG CGCTCGGCTG 950 

55 TAGAGGTCTA GCCCGTGTGG ATATGTTTTT ACAAGATAAC GGCCGCATTG 1000 

TACTGAACGA AGTCAATACT CTGCCCGGTT TCACGTCATA CAGTCGTTAT 1050 

CCCCGTATGA TGGCCGCTGC AGGTATTGCA CTTCCCGAAC TGATTGACCG 1100 

CTTGATCGTA TTAGCGTTAA AGGGGTGATA AGCATGGAAA TAGGATTTAC 1150 

TTTTTTAGAT GAAATAGTAC ACGGTGTTCG TTGGGACGCT AAATATGCCA 1200 

60 CTTGGGATAA TTTCACCGGA AAACCGGTTG ACGGTTA 1237 
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2) INFORMATION FOR SEQ ID NO: 1053 



5 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1263 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
10 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
15 (A) ORGANISM: Enterococcus faecium 

(B) STRAIN: R581 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1053 

20 CATACGGCCT ATTATACCGA GCAAGCGTTG CGTGATACCG TTGAAAAAAC 50 

CATTAAAAAC TGTTTGGATT TTGAAAGGAG ACAGGAGCAT GAATAGAATA 100 

AAAGTTGCAA TACTGTTTGG GGGTTGCTCA GAGGAGCATG ACGTATCGGT 150 

AAAATCTGCA ATAGAGATAG CCGCTAACAT TAATAAAGAA AAATACGAGC 200 

CGTTATACAT TGGAATTACG AAATCTGGTG TATGGAAAAT GTGCGAAAAA 250 

25 CCTTGCGCGG AATGGGAAAA CGACAATTGC TATTCAGCTG TACTCTCGCC 300 

GGATAAAAAA ATGCACGGAT TACTTGTTAA AAAGAACCAT GAATATGAAA 350 

TCAACCATGT TGATGTAGCA TTTTCAGCTT TGCATGGCAA GTCAGGTGAA 4 00 

. GATGGATCCA TACAAGGTCT GTTTGAATTG TCCGGTATCC CTTTTGTAGG 450 

CTGCGATATT CAAAGCTCAG CAATTTGTAT GGACAAATCG TTGACATACA 500 

30 TCGTTGCGAA AAATGCTGGG ATAGCTACTC CCGCCTTTTG GGTTATTAAT 550 

AAAGATGATA GGCCGGTGGC AGCTACGTTT ACCTATCCTG TTTTTGTTAA 600 

GCCGGCGCGT TCAGGCTCAT CCTTCGGTGT GAAAAAAGTC AATAGCGCGG 650 

ACGAATTGGA CTACGCAATT GAATCGGCAA GACAATATGA CAGCAAAATC 700 

TTAATTGAGC AGGCTGTTTC GGGCTGTGAG GTCGGTTGTG CGGTATTGGG 750 

3 5 AAACAGTGCC GCGTTAGCTG TTGGCGAGGT GGACCAAATC AGGCTGCAGT 800 

ACGGAATCTT TCGTATTCAT CAGGAAGTCG AGCCGGAAAA AGGCTCTGAA 850 

AACGCAGTTA TAACCGTTCC CGCAGACCTT TCAGCAGAGG AGCGAGGACG 900 

GATACAGGAA ACGGCAAAAA AAATATATAA AGCGCTCGGC TGTAGAGGTC 950 

TAGCCCGTGT GGATATGTTT TTACAAGATA ACGGCCGCAT TGTACTGAAC 1000 

4 0 GAAGTCAATA CTCTGCCCGG TTTCACGTCA TACAGTCGTT ATCCCCGTAT 1050 

GATGGCCGCT GCAGGTATTG CACTTCCCGA ACTGATTGAC CGCTTGATCG 1100 

TATTAGCGTT AAAGGGGTGA TAAGCATGGA AATAGGATTT ACTTTTTTAG 1150 

ATGAAATAGT ACACGGTGTT CGTTGGGACG CTAAATATGC CACTTGGGAT 1200 

AATTTCACCG GAAAACCGGT TGACGGGTAT GAAAGTAAAT CGCATTGTAG 1250 

45 GGACATTCGA ATT 1263 



50 



2) INFORMATION FOR SEQ ID NO: 1054 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1232 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
55 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
60 (A) ORGANISM: Enterococcus faecalis 
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(B) STRAIN: R610 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1054 

CGGCCTATTA TNCCGAGCAA GCGTTGCGTG ATACCGTTGA AAAAACCATT 50 

AAAAACTGTT TGGATTTTGA AAGGAGACAG GAGCATGAAT AGAATAAAAG 100 

TTGCAATACT GTTTGGGGGT TGCTCAGAGG AGCATGACGT ATCGGTAAAA 150 

TCTGCAATAG AGATAGCCGC TAACATTAAT AAAGAAAAAT ACGAGCCGTT 200 

ATACATTGGA ATTACGAAAT CTGGTGTATG GAAAATGTGC GAAAAACCTT 250 

GCGCGGAATG GGAAAACGAC AATTGCTATT CAGCTGTACT CTCGCCGGAT 300 

AAAAAAATGC ACGGATTACT TGTTAAAAAG AACCATGAAT ATGAAATCAA 350 

CCATGTTGAT GTAGCATTTT CAGCTTTGCA TGGCAAGTCA GGTGAAGATG 4 00 

GATCCATACA AGGTCTGTTT GAATTGTCCG GTATCCCTTT TGTAGGCTGC 4 50 

GATATTCAAA GCTCAGCAAT TTGTATGGAC AAATCGTTGA CATACATCGT 500 

TGCGAAAAAT GCTGGGATAG CTACTCCCGC , CTTTTGGGTT ATTAATAAAG 550 

ATGATAGGCC GGTGGCAGCT ACGTTTACCT ATCCTGTTTT TGTTAAGCCG 600 

GCGCGTTCAG GCTCATCCTT CGGTGTGAAA AAAGTCAATA GCGCGGACGA 650 

ATTGGACTAC GCAATTGAAT CGGCAAGACA ATATGACAGC AAAATCTTAA 700 

TTGAGCAGGC TGTTTCGGGC TGTGAGGTCG GTTGTGCGGT ATTGGGAAAC 750 

AGTGCCGCGT TAGTTGTTGG CGAGGTGGAC CAAATCAGGC TGCAGTACGG 800 

AATCTTTCGT ATTCATCAGG AAGTCGAGCC GGAAAAAGGC TCTGAAAACG 850 

CAGTTATAAC CGTTCCCGCA GACCTTTCAG CAGAGGAGCG AGGACGGATA 900 

CAGGAAACGG CAAAAAAAAT ATATAAAGCG CTCGGCTGTA GAGGTCTAGC 950 

CCGTGTGGAT ATGTTTTTAC AAGATAACGG CCGCATTGTA CTGAACGAAG 1000 

TCAATACTCT GCCCGGTTTC ACGTCATACA GTCGTTATCC CCGTATGATG 1050 

GCCGCTGCAG GTATTGCACT TCCCGAACTG ATTGACCGCT TGATCGTATT 1100 

AGCGTTAAAG GGGTGATAAG CATGGAAATA GGATTTACTT TTTTAGATGA 1150 

AATAGTACAC GGTGTTCGTT GGGACGCTAA ATATGCCACT TGGGATAATT 1200 

TCACCGGAAA ACCGGTTGAC GGTTATAAGT AA 1232 



2) INFORMATION FOR SEQ ID NO: 1055 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1218 bases 

(B) TYPE; Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus gallinarum 

(B) STRAIN: R684 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1055 

TACCGAGCAA GCGTTGCGTG ATACCGTTGA AAAAACCATT AAAAACTGTT 50 

TGGATTTTGA AAGGAGACAG GAGCATGAAT AGAATAAAAG TTGCAATACT 100 

GTTTGGGGGT TGCTCAGAGG AGCATGACGT ATCGGTAAAA TCTGCAATAG 150 

AGATAGCCGC TAACATTAAT AAAGAAAAAT ACGAGCCGTT ATACATTGGA 200 

ATTACGAAAT CTGGTGTATG GAAAATGTGC GAAAAACCTT GCGCGGAATG 250 

GGAAAACGAC AATTGCTATT CAGCTGTACT CTCGCCGGAT AAAAAAATGC 300 

ACGGATTACT TGTTAAAAAG AACCATGAAT ATGAAATCAA CCATGTTGAT 350 

GTAGCATTTT CAGCTTTGCA TGGCAAGTCA GGTGAAGATG GATCCATACA 400 

AGGTCTGTTT GAATTGTCCG GTATCCCTTT TGTAGGCTGC GATATTCAAA 450 

GCTCAGCAAT TTGTATGGAC AAATCGTTGA CATACATCGT TGCGAAAAAT 500 

GCTGGGATAG CTACTCCCGC CTTTTGGGTT ATTAATAAAG ATGATAGGCC 550 

GGTGGCAGCT ACGTTTACCT ATCCTGTTTT TGTTAAGCCG GCGCGTTCAG 600 
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GCTCATCCTT CGGTGTGAAA AAAGTCAATA GCGCGGACGA ATTGGACTAC 650 

GCAATTGAAT CGGCAAGACA ATATGACAGC AAAATCTTAA TTGAGCAGGC 700 

TGTTTCGGGC TGTGAGGTCG GTTGTGCGGT ATTGGGAAAC AGTGCCGCGT 7 50 

TAGCTGTTGG CGAGGTGGAC CAAATCAGGC TGCAGTACGG AATCTTTCGT 800 

5 ATTCATCAGG AAGTCGAGCC GGAAAAAGGC TCTGAAAACG CAGTTATAAC 850 

CGTTCCCGCA GACCTTTCAG CAGAGGAGCG AGGACGGATA CAGGAAACGG 900 

CAAAAAAAAT ATATAAAGCG CTCGGCTGTA GAGGTCTAGC CCGTGTGGAT 950 

ATGTTTTTAC AAGATAACGG CCGCATTGTA CTGAACGAAG TCAATACTCT 1000 

GCCCGGTTTC ACGTCATACA GTCGTTATCC CCGTATGATG GCCGCTGCAG 1050 

10 GTATTGCACT TCCCGAACTG ATTGACCGCT TGATCGTATT AGCGTTAAAG 1100 

GGGTGATAAG CATGGAAATA GGATTTACTT TTTTAGATGA AATAGTACAC 1150 

GGTGTTCGTT GGGACGCTAA ATATGCCACT TGGGATAATT TCACCGGAAA 1200 

ACCGGTTGAC GGTTAGAA 1218 



15 



30 



60 



2) INFORMATION FOR SEQ ID NO: 1056 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 1265 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecium 

(B) STRAIN: R688 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1056 



AATCACACCG CATACGGCCT ATTATACCGA GCAAGCGTTG CGTGATACCG 50 

TTGAAAAAAC CATTAAAAAC TGTTTGGATT TTGAAAGGAG ACAGGAGCAT 100 

35 GAATAGAATA AAAGTTGCAA TACTGTTTGG GGGTTGCTCA GAGGAGCATG 150 

ACGTATCGGT AAAATCTGCA ATAGAGATAG CCGCTAACAT TAATAAAGAA 200 

AAATACGAGC CGTTATACAT TGGAATTACG AAATCTGGTG TATGGAAAAT 250 

GTGCGAAAAA CCTTGCGCGG AATGGGAAAA CGACAATTGC TATTCAGCTG 3 00 

TACTCTCGCC GGATAAAAAA ATGCACGGAT TACTTGTTAA AAAGAACCAT 350 

40 GAATATGAAA TCAACCATGT TGATGTAGCA TTTTCAGCTT TGCATGGCAA 400 

GTCAGGTGAA GATGGATCCA TACAAGGTCT GTTTGAATTG TCCGGTATCC 450 

CTTTTGTAGG CTGCGATATT CAAAGCTCAG CAATTTGTAT GGACAAATCG 500 

TTGACATACA TCGTTGCGAA AAATGCTGGG ATAGCTACTC CCGCCTTTTG 550 

GGTTATTAAT AAAGATGATA GGCCGGTGGC AGCTACGTTT ACCTATCCTG 600 

45 TTTTTGTTAA GCCGGCGCGT TCAGGCTCAT CCTTCGGTGT GAAAAAAGTC 650 

AATAGCGCGG ACGAATTGGA CTACGCAATT GAATCGGCAA GACAATATGA 700 

CAGCAAAATC TTAATTGAGC AGGCTGTTTC GGGCTGTGAG GTCGGTTGTG 7 50 

CGGTATTGGG AAACAGTGCC GCGTTAGCTG TTGGCGAGGT GGACCAAATC 800 

AGGCTGCAGT ACGGAATCTT TCGTATTCAT CAGGAAGTCG AGCCGGAAAA 850 

50 AGGCTCTGAA AACGCAGTTA TAACCGTTCC CGCAGACCTT TCAGCAGAGG 900 

AGCGAGGACG GATACAGGAA ACGGCAAAAA AAATATATAA AGCGCTCGGC 950 

TGTAGAGGTC TAGCCCGTGT GGATATGTTT TTACAAGATA ACGGCCGCAT 1000 

TGTACTGAAC GAAGTCAATA CTCTGCCCGG TTTCACGTCA TACAGTCGTT 1050 

ATCCCCGTAT GATGGCCGCT GCAGGTATTG CACTTCCCGA ACTGATTGAC 1100 

55 CGCTTGATCG TATTAGCGTT AAAGGGGTGA TAAGCATGGA AATAGGATTT 1150 

ACTTTTTTAG ATGAAATAGT ACACGGTGTT CGTTGGGACG CTAAATATGC 1200 

CACTTGGGAT AATTTCACCG GAAAACCGGT TGACGGTTAT GAAGTAAATC 1250 

GCATTGTAGG GACAT 1265 
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2) INFORMATION FOR SEQ ID NO: 1057 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1269 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus flavescens 

(B) STRAIN: R689 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1057 

TCACACCGCA TACGGCCTAT TATACCGAGC AAGCGTTGCG TGATACCGTT 50 

GAAAAAACCA TTAAAAACTG TTTGGATTTT GAAAGGAGAC AGGAGCATGA 100 

ATAGAATAAA AGTTGCAATA CTGTTTGGGG GTTGCTCAGA GGAGCATGAC 150 

GTATCGGTAA AATCTGCAAT AGAGATAGCC GCTAACATTA ATAAAGAAAA 200 

ATACGAGCCG TTATACATTG GAATTACGAA ATCTGGTGTA TGGAAAATGT 250 

GCGAAAAACC TTGCGCGGAA TGGGAAAACG ACAATTGCTA TTCAGCTGTA 300 

CTCTCGCCGG ATAAAAAAAT GCACGGATTA CTTGTTAAAA AGAACCATGA 350 

ATATGAAATC AACCATGTTG ATGTAGCATT TTCAGCTTTG CATGGCAAGT 400 

CAGGTGAAGA TGGATCCATA CAAGGTCTGT TTGAATTGTC CGGTATCCCT 450 

TTTGTAGGCT GCGATATTCA AAGCTCAGCA ATTTGTATGG ACAAATCGTT 500 

GACATACATC GTTGCGAAAA ATGCTGGGAT AGCTACTCCC GCCTTTTGGG 550 

TTATTAATAA AGATGATAGG CCGGTGGCAG CTACGTTTAC CTATCCTGTT 600 

TTTGTTAAGC CGGCGCGTTC AGGCTCATCC TTCGGTGTGA AAAAAGTCAA 650 

TAGCGCGGAC GAATTGGACT ACGCAATTGA ATCGGCAAGA CAATATGACA 700 

GCAAAATCTT AATTGAGCAG GCTGTTTCGG GCTGTGAGGT CGGTTGTGCG 750 

GTATTGGGAA ACAGTGCCGC GTTAGCTGTT GGCGAGGTGG ACCAAATCAG 800 

GCTGCAGTAC GGAATCTTTC GTATTCATCA GGAAGTCGAG CCGGAAAAAG 850 

GCTCTGAAAA CGCAGTTATA ACCGTTCCCG CAGACCTTTC AGCAGAGGAG 900 

CGAGGACGGA TACAGGAAAC GGCAAAAAAA ATATATAAAG CGCTCGGCTG 950 

TAGAGGTCTA GCCCGTGTGG ATATGTTTTT ACAAGATAAC GGCCGCATTG 1000 

TACTGAACGA AGTCAATACT CTGCCCGGTT TCACGTCATA CAGTCGTTAT 1050 

CCCCGTATGA TGGCCGCTGC AGGTATTGCA CTTCCCGAAC TGATTGACCG 1100 

CTTGATCGTA TTAGCGTTAA AGGGGTGATA AGCATGGAAA TAGGATTTAC 1150 

TTTTTTAGAT GAAATAGTAC ACGGTGTTCG TTGGGACGCT AAATATGCCA 1200 

CTTGGGATAA TTTCACCGGA AAACCGGTTG ACGGTTATGA AGTAAATCGC 1250 

ATTGTAGGGA CATACGAAT 1269 



2) INFORMATION FOR SEQ ID NO: 1058 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 116 9 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus gallinarum 

(B) STRAIN: R420 

599 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1058 



PCT/CA00/01150 



CAAATTTTCT 
AAAGGAAAGG 
5 ATTCTCCAGA 
GCTATTAACC 
AATGGATTGG 
CTTGGCTAGA 
GGATTTATAT 

10 CTTGCATGGG 
TAATGAACCT 
ATGAACAAAT 
TGCTCCCACT 
ATCGTTTTAT 

15 GCCGGTTCTT 
ATCTGCATTA 
AGGCGATAGC 
TTGACGATTG 
TTTTGAAGAG 

20 CATTGCCTCT 
TATCGAAACT 
CAATCAAGGA 
CTGGGCACTC 
GAAATATTAG 

25 AACACATTAC 
GCCCCCGCAC 



TTTCTTTTCC 
AAGAAAATGA 
ATACTCAGTG 
CGCTGAAATA 
TATTGGTATC 
AGATCACAAA 
TAGGAGAAAA 
AAGTATGGCG 
GCCTTATGTT 
GGCTCTTGCA 
TTGCTTTTAT 
TCAAGACCAT 
CAAAAGGGAT 
ACGACTGCTT 
GGGTATTGAA 
GTGCTTGTGA 
AAATACCAAT 
CGCGCTTGAA 
TGGGATTGAC 
GCGATTTATT 
CCGCTACCCA 
TAGAGCAATT 
AATTGATCAA 
TTAGTGCTA 



TAGGTACACT 
TGAAAAAAAT 
TCACTAACCT 
TGAAGTAATG 
AAGGAAACCT 
AACTGTCACC 
ACGAATCGTC 
AGGATGGCTG 
GGTTGCCATG 
TCAACTTGCT 
CCCGCTATGA 
GGATTCCCGA 
CACAAAAGTA 
TTGCTTACGG 
ATTGGCTGCG 
TGCGATTTCT 
TAATCAGCGC 
TCACAGATCA 
GGGTCTGGCT- 
TAAACGAAAT 
GCTATGATGG 
GATTGCACTG 
TAAAAACCAT 



GAATGTAACC 
TGCCGTTTTA 
CAGCAGCAAG 
ACCATTGGCA 
CGCGAATGTT 
AGCTGACTTT 
CCTGATGTCC 
TATCCAAGGA 
TCGCTGCCTC 
GATACCATGG 
AAACGATCCT 
TCTTTATCAA 
ACTGACAAAA 
TTCTACTGTG 
GCATCTTAGG 
CTTGTCGACG 
CACGATCACT 
AGGAGCAGGC 
CGAATCGATT 
CAACACCATG 
CGGAAGTCGG 
GCAGAGGAGG 
CCATTGAAAA 



TTAAAAGAAA 
TTTGGAGGGA 
TGTGATCCAA 
TCGCACCAAC 
CGCAATGATA 
TTCTAGCCAA 
TCTTTCCAGT 
CTGCTTGAAC 
CGCATTATGT 
GAATCGCTAG 
GCCACAATCG 
GCCGAATGAA 
CAGCGCTCCA 
TTGATCCAAA 
AAATGAGCAA 
GTTTTTTTGA 
GTCCCAGCAC 
ACAGCTGCTT 
TTTTCGTCAC 
CCGGGATTTA 
GTTATCCTAC 
ACAAACGATG 
AAAATCAAGA 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
850 
900 
950 
1000 
1050 
1100 
1150 
1169 



30 2) INFORMATION FOR SEQ ID NO: 1059 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1166 bases 

(B) TYPE: Nucleic acid 
35 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

40 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus gallinarum 

(B) STRAIN: R631 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1059 



CAAATTTTCT TTTCTTTTCC TAGGTACACT GAATGTAACC TTAAAAGAAA 50 

AAAGGAAAGG AAGAAAATGA TGAAAAAAAT TGCCGTTTTA TTTGGAGGGA 100 

ATTCTCCAGA ATACTCAGTG TCACTAGCCT CAGCAGCAAG TGTGATCCAA 150 

GCTATTGACC CGCTGAAATA TGAAGTAATG ACCATTGGCA TCGCACCAAC 200 

50 AATGGATTGG TATTGGTATC AAGGAAACCT CGCGAATGTT CGCAATGATA 250 

CTTGGCTAGA AGATCACAAA AACTGTCACC AGCTGACTTT TTCTAGCCAA 300 

GGATTTATAT TAGGAGAAAA ACGAATCGTC CCTGATGTCC TCTTTCCAGT 350 

CTTGCATGGG AAGTATGGCG AGGATGGCTG TATCCAAGGA CTGCTTGAAC 400 

TAATGAACCT GCCTTATGTT GGTTGCCATG TCGCTGCCTC CGCATTATGT 450 

55 ATGAACAAAT GGCTCTTGCA TCAACTTGCT GATACCATGG GAATCGCTAG 500 

TGCTCCCACT TTGCTTTTAT CCCGCTATGA AAACGATCCT GCCACAATCG 550 

ATCGTTTTAT TCAAGACCAT GGATTCCCGA TCTTTATCAA GCCGAATGAA 600 

GCCGGTTCTT CAAAAGGGAT CACAAAAGTA ACTGACAAAA CAGCGCTCCA 650 

ATCTGCATTA ACGACTGCTT TTGCTTACGG TTCTACTGTG TTGATCCAAA 700 

60 AGGCGATAGC GGGTATTGAA ATTGGCTGCG GCATCTTAGG AAATGAGCAA 750 

600 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

TTGACGATTG GTGCTTGTGA 
TTTTGAAGAG AAATACCAAT 
CATTGCCTCT CGCGCTTGAA 
TATCGAAACT TGGGATTGAC 
5 CAATCAAGGA GCGATTTATT 
CTGGGCACTC CCGCTACCCA 
GAAATATTAG TAGAGCAATT 
AACACATTAC AATTGATCAA 
GCCCCCGCAC TTAGTG 



TGCGATTTCT CTTGTCGACG 
TAATCAGCGC CACGATCACT 
TCACAGATCA AGGAGCAGGC 
GGGTCTGGCT CGAATCGATT 
TAAACGAAAT CAACACCATG 
GCTATGATGG CGGAAGTCGG 
GATTGCACTG GCAGAGGAGG 
TAAAAACCAT CCATTGAAAA 



PCT/CA00/01150 

GTTTTTTTGA 800 

GTCCCAGCAC 850 

ACAGCTGCTT 900 

TTTTCGTCAC 950 

CCGGGATTTA 1000 

GTTATCCTAC 1050 

ACAAACGATG 1100 

AAAATCAAGA 1150 
1166 



2) INFORMATION FOR SEQ ID NO: 1060 

15 (i) SEQUENCE CHARACTERISTICS:, 

(A) LENGTH: 1028 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



20 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus casseli flaws 
25 (B) STRAIN: ATCC 25788 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1060 

AACATGAAAA AAATCGCCTT ATTTTTGGAG GCAATTCACC GGAATACACC 50 

30 GTTTCTTTAG CTTCAGCAAC TAGCGCAATC GAAGCACTCC AATCATCTCC 100 

CTATGACTAC GACCTCTCTT TGATCGGGAT CGCCCCAGAT GCTATGGATT 150 

GGTACTTGTA TACAGGAGAA CTGGAAAACA TCCGACAAGA CACGTGGTTG 2 00 

TTGGATACGA AACATAAACA GAAAATACAG CCGCTATTCG AAGGAAACGG 250 

CTTTTGGCTA AGTGAAGAGC AGCAAACGTT GGTACCTGAT GTTTTATTTC 300 

35 CCATTATGCA TGGCAAATAC GGGGAAGATG GCAGTATCCA AGGATTGTTT 350 

GAATTGATGA AGCTGCCTTA TGTAGGCTGC GGGGTGGCAG GTTCTGCCTT 4 00 

ATGTATGAAC AAATGGCTGC TGCATCAAGC TGCAGCAGCC ATTGGCGTAC 4 50 

AAAGTGCTCC TACGATTCTC TTGACAAATC AAGCCAACCA GCAAGAACAA 500 

ATCGAAGCTT TTATCCAGAC CCATGGCTTC CCAGTTTTCT TTAAGCCTAA 550 

4 0 TGAAGCGGGC TCCTCAAAAG GGATCACTAA AGTCACCTGC GTTGAAGAAA 600 

TCGCTTCTGC CTTAAAAGAA GCCTTTACTT ATTGTTCCGC AGTGCTCCTA 650 

CAAAAAAATA TTGCCGGTGT TGAGATCGGT TGCGGTATTT TGGGCAACGA 700 

CTCTTTGACT GTCGGTGCTT GTGACGCCAT TTCATTAGTA GACGGCTTTT 750 

TCGATTTTGA AGAAAAGTAC CAGCTGATCA GCGCCAAAAT CACCGTCCCT 8 00 

45 GCGCCATTGC CTGAAACGAT TGAAACCAAG GTCAAAGAAC AAGCTCAGCT 850 

GCTCTATCGT AGTCTTGGTC TTAAAGGTCT TGCTCGCATC GACTTTTTTG 900 

TCACGGAGCG AGGAGAACTA TACTTGAATG AAATCAATAC TATGCCGGGC 950 

TTTACGAGTC ACTCCCGCTA TCCTGCCATG ATGGCAGCGG TCGGCTTATC 1000 

CTATCAAGAA CTACTACAAA AACTGCTT 1028 

50 



2) INFORMATION FOR SEQ ID NO: 1061 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1030 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

60 

601 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

(ii) MOLECULE TYPE: Genomic DNA 



PCT/CAOO/01150 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus casseliflavus 
5 (B) STRAIN: R689 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1061 



AATATGAAAA 

10 GTTTCTTTAG 
CGATGACTAT 
GGTATTTGTA 
TTGGATACGA 
CTTTTGGCTA 

15 CCATTATGCA 
GAATTGATGA 
ATGTATGAAC 
AAAGTGCTCC 
ATCGAAGCTT 

20 TGAAGCGGGC 
TCGCTTCTGC 
CAAAAAAATA 
CTCTTTGACT 
TCGATTTTGA 

25 GCACCATTGC 
GCTCTATCGT 
TCACGGATCA 
TTTACGAGTC 
CTATCAAGAA 

30 



AAATCGCCTT 
CCTCAGCAAC 
GACCTCTCTT 
TACAGGAGAA 
AACATAAACA 
AGTGAAGAGC 
TGGCAAATAC 
AACTACCTTA 
AAATGGCTGC 
TACGATTCTC 
TTATCCAGAC 
TCCTCAAAAG 
CTTAAAAGAA 
TTGCCGGTGT 
GTCGGTGCTT 
AGAAAAGTAC 
CTGAAACGAT 
AGTCTTTGTC 
AGGAGAACTA 
ACTCCCGCTA 
CTACTACAAA 



ATTTTTGGAG 
TAGCGCAATC 
TGATCGGGAT 
CTGGAAAACA 
GAAAATCCAG 
AACAAACGTT 
GGGGAAGATG 
TGTAGGCTGC 
TGCATCAAGC 
TTGACAAATC 
CCATGGCTTT 
GGATCACTAA 
GCCTTTACTT 
TGAGATCGGT 
GTGACGCTAT 
CAGCTGATCA 
TGAAACCAAG 
TTAAAGGTCT 
TACTTGAATG 
TCCTGCCATG 
AACTGCTTGT 



GCAATTCACC 
GAAGCACTCC 
CGCCCCAGAT 
TCCGACAAGA 
CCGCTTTTTG 
GGTTCCTGAT 
GCAGTATCCA 
GGGGTGGCAG 
AGCAGAAGCG 
AAGCCAACCA 
CCGGTTTTTT 
AGTCACCTGC 
ATTGTTCAGC 
TGCGGTATTT 
TTCATTAGTA 
GCGCCAAGAT 
GTCAAAGAAC 
TGCTCGCATC 
AAATCAATAC 
ATGGCAGCGA 



GG AGTACGCC 50 

AATCATCTCC 100 

GCTATGGATT 150 

CACGTGGTTG 2 00 

AAGGAAACGG 250 

GTTTTATTTC 300 

AGGATTGTTT 350 

CCTCTGCCTT 4 00 

ATTGGCGTAC 450 

GCAAGATCAA 500 

TTAAGCCTAA 550 

GTTGAAGAAA 600 

AGTGCTCCTA 650 

TGGGCAACGA 700 

GACGGCTTTT 750 

CACCGTTCCT 800 

AAGCTCAGCT 850 

GACTTTTTTG 900 

TATGCCGGGC 950 

TCGGCTTATC 10 00 
1030 



35 



40 



45 



2) INFORMATION FOR SEQ ID NO: 1062 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1031 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus casseliflavus 

(B) STRAIN: R754 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1062 



AAACATGAAA 
50 CCGTTTCTTT 
CCCTATGACT 
TTGGTACTTG 
TGTTGGATAC 
GGCTTTTGGC 
55 TCCCATTATG 
TTGAATTGAT 
TTATGTATGA 
ACAAAGTGCT 
AAATCGAAGC 
60 AATGAAGCGG 



AAAATCGCCA 
AGCTTCAGCA 
ACGACCTCTC 
TATACAGGAG 
GAAACATAAA 
TAAGTGAAGA 
CATGGCAAAT 
GAAGCTGCCT 
ACAAATGGCT 
CCTACGATTC 
TTTTATCCAG 
GCTCCTCAAA 



TTATTTTTGG 
ACTAGCGCAA 
TTTGATCGGG 
AACTGGAAAA 
CAGAAAATAC 
GCAGCAAACG 
ACGGGGAAGA 
TATGTAGGCT 
GCTGCATCAA 
TCTTGACAAA 
ACCCATGGCT 
AGGGATCACT 



AGGCAATTCA 
TCGAAGCACT 
ATCGCCCCAG 
CATCCGACAA 
AGCCGCTATT 
TTGGTACCTG 
TGGCAGTATC 
GCGGGGTGGC 
GCTGCAGCAG 
TCAAGCCAAC 
TCCCAGTTTT 
AAAGTCACCT 



CCGGAATACA 50 

CCAATCATCT 100 

ATGCTATGGA 150 

GACACGTGGT 200 

CGAAGGAAAC 250 

ATGTTTTATT 300 

CAAGGATTGT 350 

AGGTTCTGCC 4 00 

CCATTGGCGT 4 50 

CAGCAAGAAC 500 

CTTTAAGCCT 550 

GCGTTGAAGA 600 



602 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CAOO/01150 



AATCGCTTCT GCCTTAAAAG 

TACAAAAAAA TATTGCCGGT 

GACTCTTTGA CTGTCGGTGC 

TTTCGATTTT GAAGAAAAGT 

5 CTGCGCCATT GCCTGAAACG 

CTGCTCTATC GTAGTCTTGG 

TGTCACGGAG CGAGGAGAAC 

GCTTTACGAG TCACTCCCGC 

TCCTATCAAG AACTACTACA 



AAGCCTTTAC TTATTGTTCC 
GTTGAGATCG GTTGCGGTAT 
TTGTGACGCC ATTTCATTAG 
ACCAGCTGAT CAGCGCCAAA 
ATTGAAACCA AGGTCAAAGA 
TCTTAAAGGT CTTGCTCGCA 
TATACTTGAA TGAAATCAAT 
TATCCTGCCA TGATGGCAGC 
AAAACTGCTT G 



GCAGTGCTCC 650 

TTTGGGCAAC 700 

TAGACGGCTT 750 

ATCACCGTCC 800 

ACAAGCTCAG 850 

TCGACTTTTT 900 

ACTATGCCGG 950 

GGTCGGCTTA 1000 
1031 



2) INFORMATION FOR SEQ ID NO: 1063 

15 (i) SEQUENCE CHARACTERISTICS: , 

(A) LENGTH: 1030 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



20 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus casseliflavus 

25 (B) STRAIN: R775 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1063 

AACATGAAAA AAATCGCCAT TATTTTTGGA GGCAATTCAC CGGAATACAC 50 

30 CGTTTCTTTA GCTTCAGCAA CTAGCGCAAT CGAAGCACTC CAATCATCTC 100 

CCTATGACTA CGACCTCTCT TTGATCGGGA TCGCCCCAGA TGCTATGGAT 150 

TGGTACTTGT ATACAGGAGA ACTGGAAAAC ATCCGACAAG ACACGTGGTT 200 

GTTGGATACG AAACATAAAC AGAAAATACA GCCGCTATTT GAAGGAAACG 250 

GCTTTTGGCT AAGTGAAGAG CAGCAAACGT TAGTACCTGA TATTTTATTT 300 

35 CCCATTATGC ATGGCAAATA CGGGGAAGAT GGCAGTATCC AAGGATTGTT 350 

TGAATTGATG AAACTACCTT ATGTAGGTTG CGGGGTGGCA GGTTCTGCCT 4 00 

TATGTATGAA CAAATGGCTG CTGCATCAAG CTGCAGCAGC CATTGGCGTA 450 

CAAAGTGCTC CTACGATTCT CTTGACAAAT CAAGCCAACC AGCAAGAACA 500 

AATCGAAGCT TTTATCCAGA CCCATGGCTT CCCAGTTTTC TTTAAGCCTA 550 

4 0 ATGAAGCGGG CTCTTCAAAA GGGATCACTA AAGTCACCTG CGTTGAAGAA 600 

ATCGCTTCTG CCTTAAAAAA AGCCTTTACT TATTGTTCCG CAGTGCTCCT 650 

ACAAAAAAAT ATTGCCGGTG TTGAGATCGG TTGCGGTATT TTGGGCAACG 700 

ACTCTTTGAC TGTCGGTGCT TGTGACGCCA TTTCATTAGT AGACGGCTTT 750 

TTCGATTTTG AAGAAAAGTA CCAGCTGATC AGCGCCAAAA TCACCGTCCC 800 

45 TGCGCCATTG CCTGAAACGA TTGAAACCAA GGTCAAAGAA CAAGCTCAGC 850 

TGCTCTATCG TAGTCTTGGT CTTAAAGGTC TTGCTCGCAT CGACTTTTTT 900 

GTCACGGATC AAGGAGAACT ATACTTGAAT GAAATCAATA CTATGCCGGG 950 

CTTTACGAGT CACTCCCGTT ATCCTGCCAT GATGGCAGCG GTCGGCTTAT 1000 

CCTATCAAGA ACTACTACAA AAACTGCTTG 1030 

50 



2) INFORMATION FOR SEQ ID NO: 1064 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1032 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

60 

603 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

(ii) MOLECULE TYPE: Genomic DNA 



PCT/CA00/0J150 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus flavescens 
5 (B) STRAIN: ATCC 49996 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1064 

AAACATGAAA AAAATCGCCA TTATTTTTGG AGGCAATTCA CCGGAATACA 50 

L0 CCGTTTCTTT AGCTTCAGCA ACTAGCGCAA TCGAAGCACT CCAATCATCT 100 

CCCTATGACT ACGACCTCTC TTTGATCGGG ATCGCCCCAG ATGCTATGGA 150 

TTGGTACTTG TATACAGGAG AACTGGAAAA CATCCGACAA GACACGTGGT 200 

TGTTGGATAC GAAACAGAAA CAGAAAATAC AGCCGCTATT CGAAGGAAAC 250 

GGCTTTTGGT TAAGTGAAGA GCAGCAAACG TTGGTACCTG ATGTTTTATT 300 

L5 TCCCATTATG CATGGCAAAT ACGGGGAAGA TGGCAGTATC CAAGGATTGT 350 

TTGAATTGAT GAAGCTACCT TATGTAGGCT GCGGGGTGGC AGGTTCTGCC 400 

TTATGTATGA ACAAATGGTT GCTGCATCAA GCTGCAGCAG CCATTGGCGT 450 

ACAAAGTGCT CCTACGATTC, TCTTGACAAA TCACGCCAAC CAGCAAGAAC 50 0 

AAATCGAAGC TTTTATCCAG ACCCATGGCT TTCCAGTTTT CTTTAAGCCT 550 

20 AATGAAGCGG GTTCCTCAAA AGGGATCACT AAAGTCACCT GCGTTGAAGA 600 

AATCGCTTCT GCCTTAAAAG AAGCCTTTAC TTATTGTTCC GCAGTGCTCC 650 

TACAAAAAAA TATTGCCGGT GTTGAGATCG GTTGCGGTAT TTTGGGCAAC 700 

GACTCTTTGA CTGTCGGTGC TTGTGACGCC ATTTCATTAG TAGACGGCTT 750 

TTTCGATTTT GAAGAAAAGT ACCAGCTGAT CAGCGCCAAA ATCACCGTCC 800 

25 CTGCGCCATT GCCTGAAACG ATTGAAACTA AGGTCAAAGA ACAAGCTCAG 850 

CTGCTCTATC GTAGTCTTGG ACTTAAAGGT CTTGCTCGCA TCGACTTTTT 900 

TGTCACGGAT CAAGGAGAAC TATACTTGAA TGAAATCAAT ACTATGCCGG 950 

GCTTTACGAG TCACTCCCGC TATCCTGCCA TGATGGCAGC GGTCGGGTTA 1000 

TCCTATCAAG AACTACTACA AAAACTACTT GT 1032 

30 



2) INFORMATION FOR SEQ ID NO: 1065 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1034 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



40 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus flavescens 
45 (B) STRAIN: R758 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 1065 

AAAAACATGA AAAAAATCGC CATTATTTTT GGAGGCAATT CACCGGAATA 50 

50 CACCGTTTCT TTAGCTTCAG CAACTAGCGC AATCGAAGCA CTCCAATCAT 100 

CTCCCTATGA CTACGACCTC TCTTTGATCG GGATCGCCCC AGATGCTATG 150 

GATTGGTACT TGTATACAGG AGAACTGGAA AACATCCGAC AAGACACGTG 200 

GTTGTTGGAT ACGAAACATA AACAGAAAAT ACAGCCGCTA TTCGAAGGAA 250 

ACGGCTTTTG GCTAAGTGAA GAGCAGCAAA CGTTGGTACC TGATGTTTTA 300 

55 TTTCCCATTA TGCATGGCAA ATACGGGGAA GATGGCAGTA TCCAAGGATT 350 

GTTTGAATTG ATGAAGCTGC CTTATGTAGG CTGCGGGGTG GCAAGTTCTG 400 

CCTTATGTAT GAACAAATGG CTGCTGCATC AAGCTGCAGC AGCCATTGGC 4 50 

GTACAAAGTG CTCCTACGAT TCTCTTGACA AATCAAGCCA ACCAGCAAGA 500 

ACAAATCGAA GCTTTTATCC AGACCCATGG CTTTCCAGTT TTCTTTAAGC 550 

60 CTAATGAAGC GGGCTCCTCA AAAGGGATCA CTAAAGTCAC CTGCGTTGAA 600 

604 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

GAAATCGCTT CTGCCTTAAA 
CCTACAAAAA AATATTGCCG 
ACGACTCTTT GACTGTCGGT 
TTTTTCGATT TTGAAGAAAA 
5 CCCTGCGCCA TTGCCTGAAA 
AGCTGCTCTA TCGTAGTCTT 
TTTGTCACGG ATCAAGGAGA 
GGGCTTTACG AGTCACTCCC 
TATCCTATCA AGAACTACTA 



AGAAGCCTTT ACTTATTGTT 
GTGTTGAGAT CGGTTGCGGT 
GCTTGTGACG CCATTTCATT 
GTACCAGCTG ATCAGCGCCA 
CGATTGAAAC CAAGGTCAAA 
GGTCTTAAAG GTCTTGCTCG 
ACTATACTTG AATGAAATCA 
GCTATCCTGC CATGATGGCA 
CAAAAACTGC TTGT 



PCT/CA00/01150 

CCGCAGTGCT 650 

ATTTTGGGCA 700 

AGTAGACGGC 750 

AAATCACCGT 80 0 

GAACAAGCTC 850 

CATCGACTTT 900 

ATACTATGCC 950 

GCGGTCGGCT 1000 
1034 



2) INFORMATION FOR SEQ ID NO: 1066 

15 (i) SEQUENCE CHARACTERISTICS:, 

(A) LENGTH: 1012 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



20 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus flavescens 
25 (B) STRAIN: R760 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1066 

CATGAAAAAA ATCGCCATTA TTTTTGGAGG CAATTCACCG GAATACACCG 50 

30 TTTCTTTAGC CTCAGCAACT AGCGCAATCG AAGCACTCCA ATCATCTCCC 100 

TATGATTACG ACCTCTCTTT GATCGGGATC GCCCCAGATG CTATGGATTG 150 

GTACTTGTAT ACAGGAGAAC TGGAAAACAT CCGACAAGAC ACGTGGTTGT 200 

TGGATACGAA ACATACACAG AAAATCCAGC CACTTTTTGA AGGAAACGGC 250 

TTTTGGATAA GTGAAGCGCA GCAAACGTTG GTACCTGATG TTTTATTTCC 300 

35 CATTATGCAT GGTAAATACG GGGAAGATGG CAGTATCCAA GGATTGTTTG 350 

AATTGATGAA GCTGCCTTAT GTAGGCTGTG GGGTGGCAGC CTCTGCCTTA 400 

TGTATGAACA AATGGTTATT GCATCAAGCA GCAGCAGCGA TTGGCGTACA 450 

AAGCGCTCCT ACGATTCTCT TGACAAATCA AGCCAACCAG CAAAGACAAA 500 

TCGAAGCCTT TATCCAGACC CATGGCTTTC CAGTTTTCTT TAAGCCTAAT 550 

40 GAAGCGGGCT CCTCAAAAGG GATCACAAAA GTAACTTGTG TTGAAGAAAT 600 

CGCTCCTGCC TTGAAGGAAG CCTTCGCTTA TTGTTCCGCA GTGCTCTTAC 650 

AAAAAAATAT CGCTGGCGTT GAGATTGGTT GCGGTATCTT AGGCAACGAC 700 

TCTTTGACTG TCGGTGCTTG TGACGCTATT TCATTAGTAG ACGGCTTTTT 750 

CGATTTTGAA GAAAAGTACC AGTTGATCAG CGCCAAAATC ACCGTTCCTG 800 

45 CGCCATTGCC TGAAACGATT GAAACCAAAG TCAAAGAACA AGCTCAGCTG 850 

CTCTATCACA GTCTTGGTCT TAAAGGACTT GCTCGCATCG ACTTTTTTGT 900 

CACGGATCAA GGAGAACTAT ACTTGAATGA AATCAATACT ATGCCGGGCT 950 

TTACGAGTCA CTCCCGCTAT CCTGCCATGA TGGCAGCGGT CGGCTTATCC 1000 

TATCAAGAAT TA 1012 

50 



2) INFORMATION FOR SEQ ID NO: 1067 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 721 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

60 

605 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

(ii) MOLECULE TYPE: Genomic DNA 



PCT/CA00/01150 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecium 
5 (B) STRAIN: R481 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1067 

CTTACGCTTT ATCGATTAGA CACGGGNAGC TTGTCCAATG GGRAGCCGAT 50 

10 TTGATTTTAT GGATGAACGC TCTCATCATG CGGCAAATGG AATATCATGC 100 

AATGAAGCGC AAAATCGCAG ACGTTTGCGC TCCATCATGG AAAACAGTGG 150 

GTTTGAAGCA TATAGCCTCG AATGGTGGCA CTATGTATTA AGAGACGAAC 2 00 

CATACCCCAA TAGCTATTTT GATTTCCCCG TTAAATAAAC TTTTAACCGT 250 

TGCACGGACA AACTATATAA GCTAACTCTT TCGGCAGGAA ACCCGACGTA 300 

15 TGTAACTGGT TCTTAGGGAA TTTATATATA GTAGATAGTA TTGAAGATGT 350 

AAGGCAGAGC GATATTGCGG TCATTATCTG CGTGCGCTGC GGCAAGATAG 4 00 

CCTGATAATA AGACTGATCG CATAGAGGGG TGGTATTTCA CACCGCCCAT 450 

TGTCAACAGG CAGTTCAGCC, TCGTTAAATT CAGCATGGGT ATCACTTATG 500 

AAAATTCATC TACATTGGTG ATAATAGTAA ATCCAGTAGG GCGAAATAAT 550 

20 TGACTGTAAT TTACGGGGCA AAACGGCACA ATCTCAAACG AGATTGTGCC 600 

GTTTAAGGGG AAGATTCTAG AAATATTTCA TACTTCCAAC TATATAGTTA 650 

AGGAGGAGAC TGAAAATGAA GAAGTTGTTT TTTTTATTGT TATTGTTATT 700 

CTTAATATAC TTAGGTTATT G 721 



25 



40 



2) INFORMATION FOR SEQ ID NO: 1068 



(i) SEQUENCE CHARACTERISTICS: 
3 0 (A) LENGTH: 668 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

35 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecium 

(B) STRAIN: R492 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1068 



ATTTTTAAGG ATGAACGCTC TTCATCATGC GGCAAATGGA ATATCATGCA 50 

ATGAAGCGCA AAATCGCAGA CGTTTGCGCT CCATCATGGA AAACAGTGGG 100 

45 TTTGAAGCAT ATAGCCTCGA ATGGTGGCAC TATGTATTAA GAGACGAACC 150 

ATACCCCAAT AGCTATTTTG ATTTCCCCGT TAAATAAACT TTTAACCGTT 200 

GCACGGACAA ACTATATAAG CTAACTCTTT CGGCAGGAAA CCCGACGTAT 250 

GTAACTGGTT CTTAGGGAAT TTATATATAG TAGATAGTAT TGAAGATGTA 300 

AGGCAGAGCG ATATTGCGGT CATTATCTGC GTGCGCTGCG GCAAGATAGC 350 

50 CTGATAATAA GACTGATCGC ATAGAGGGGT GGTATTTCAC ACCGCCCATT 4 00 

GTCAACAGGC AGTTCAGCCT CGTTAAATTC AGCATGGGTA TCACTTATGA 450 

AAATTCATCT ACATTGGTGA TAATAGTAAA TCCAGTAGGG CGAAATAATT 500 

GACTGTAATT TACGGGGCAA AACGGCACAA TCTCAAACGA GATTGTGCCG 550 

TTTAAGGGGA AGATTCTAGA AATATTTCAT ACTTCCAACT ATATAGTTAA 600 

55 GGAGGAGACT GAAAATGAAG AAGTTGTTTT TTTTATTGTT ATTGTTATTC 650 

TTAATATACT TAGGTTAT 668 



60 2) INFORMATION FOR SEQ ID NO: 1069 

606 
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(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 760 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecium 

(B) STRAIN: R581 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1069 

CGGCAAGTGC CATTGATCTT ACGCTTTATC GATTAGACAC GGGTRAGCTT 50 

GTACCAATGG GAAGCCGATT TGATTTTATG GATGAACGCT CTCATCATGC 100 

GGCAAATGGA ATATCATGCA ATGAAGCGCA AAATCGCAGA CGTTTGCGCT 150 

CCATCATGGA AAACAGTGGG TTTGAAGCAT ATAGCCTCGA ATGGTGGCAC 200 

TATGTATTAA GAGACGAACC ATACCCCAAT AGCTATTTTG ATTTCCCCGT 250 

TAAATAAACT TTTAACCGTT GCACGGACAA ACTATATAAG CTAACTCTTT 300 

CGGCAGGAAA CCCGACGTAT GTAACTGGTT CTTAGGGAAT TTATATATAG 350 

TAGATAGTAT TGAAGATGTA AGGCAGAGCG ATATTGCGGT CATTATCTGC 400 

GTGCGCTGCG GCAAGATAGC CTGATAATAA GACTGATCGC ATAGAGGGGT 450 

GGTATTTCAC ACCGCCCATT GTCAACAGGC AGTTCAGCCT CGTTAAATTC 500 

AGCATGGGTA TCACTTATGA AAATTCATCT ACATTGGTGA TAATAGTAAA 550 

TCCAGTAGGG CGAAATAATT GACTGTAATT TACGGGGCAA AACGGCACAA 600 

TCTCAAACGA GATTGTGCCG TTTAAGGGGA AGATTCTAGA AATATTTCAT 650 

ACTTCCAACT ATATAGTTAA GGAGGAGACT GAAAATGAAG AAGTTGTTTT 7 00 

TTTTATTGTT ATTGTTATTC TTAATATACT TAGGTTATGA CTACGTTAAT 750 

GAAANCCTGA 7 60 



2) INFORMATION FOR SEQ ID NO: 1070 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 801 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecalis 

(B) STRAIN: R610 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1070 

AAAAGGAATA CGGGGCTTTC AAAAATCCAA GCCATAACCC GCGGGCAAGT 50 

GCCATTTGAT TCTTACGCTT TAATCGATTA GACACGGGTA AGCTTGTACC 100 

AATGGGGAAC CGATTTGATT TTAATGNATG AACGCTCTTC ATCATGCGGC 150 

AAATGGAATA TCATGCAATG AAGCGCAAAA TCGCAGACGT TTGCGCTCCA 200 

TCATGGAAAA CAGTGGGTTT GAAGCATATA GCCTCGAATG GTGGCACTAT 250 

GTATTAAGAG ACGAACCATA CCCCAATAGC TATTTTGATT TCCCCGTTAA 300 

ATAAACTTTT AACCGTTGCA CGGACAAACT ATATAAGCTA ACTCTTTCGG 350 

CAGGAAACCC GACGTATGTA ACTGGTTCTT AGGGAATTTA TATATAGTAG 4 00 

ATAGTATTGA AGATGTAAGG CAGAGCGATA TTGCGGTCAT TATCTGCGTG 450 

CGCTGCGGCA AGATAGCCTG ATAATAAGAC TGATCGCATA GAGGGGTGGT 500 

607 
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ATTTCACACC GCCCATTGTC AACAGGCAGT 

ATGGGTATCA CTTATGAAAA TTCATCTACA 

AGTAGGGCGA AATAATTGAC TGTAATTTAC 

CAAACGAGAT TGTGCCGTTT AAGGGGAAGA 

5 TCCAACTATA TAGTTAAGGA GGAGACTGAA 

TTATTGTTAT TGTTATTCTT AATATACTTA 
A 
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TCAGCCTCGT TAAATTCAGC 550 

TTGGTGATAA TAGTAAATCC 600 

GGGGCAAAAC GGCACAATCT 650 

TTCTAGAAAT ATTTCATACT 700 

AATGAAGAAG TTGTTTTTTT 750 

GGTTATGACT ACGTTAATGA 800 

801 



10 



2) INFORMATION FOR SEQ ID NO: 1071 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 711 bases 
15 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



20 



(ii) MOLECULE TYPE : • Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus gallinarum 

(B) STRAIN: R684 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1071 

TTGTACCAAT GGGGAGCCGA TTTGATTTTA TGGATGAACG CTCTCATCAT 50 

GCGGCAAATG GAATATCATG CAATGAAGCG CAAAATCGCA GACGTTTGCG 100 

CTCCATCATG GAAAACAGTG GGTTTGAAGC ATATAGCCTC GAATGGTGGC 150 

30 ACTATGTATT AAGAGACGAA CCATACCCCA ATAGCTATTT TGATTTCCCC 200 

GTTAAATAAA CTTTTAACCG TTGCACGGAC AAACTATATA AGCTAACTCT 250 

TTCGGCAGGA AACCCGACGT ATGTAACTGG TTCTTAGGGA ATTTATATAT 300 

AGTAGATAGT ATTGAAGATG TAAGGCAGAG CGATATTGCG GTCATTATCT 350 

GCGTGCGCTG CGGCAAGATA GCCTGATAAT AAGACTGATC GCATAGAGGG 4 00 

3 5 GTGGTATTTC ACACCGCCCA TTGTCAACAG GCAGTTCAGC CTCGTTAAAT 4 50 

TCAGCATGGG TATCACTTAT GAAAATTCAT CTACATTGGT GATAATAGTA 500 

AATCCAGTAG GGCGAAATAA TTGACTGTAA TTTACGGGGC AAAACGGCAC 550 

AATCTCAAAC GAGATTGTGC CGTTTAAGGG GAAGATTCTA GAAATATTTC 600 

ATACTTCCAA CTATATAGTT AAGGAGGAGA CTGAAAATGA AGAAGTTGTT 650 

4 0 TTTTTTATTG TTATTGTTAT TCTTAATATA CTTAGGTTAT GACTACGTTA 700 

ATGAAGCACT G 711 



45 2) INFORMATION FOR SEQ ID NO: 1072 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 751 bases 

(B) TYPE: Nucleic acid 
50 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

55 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecium 

(B) STRAIN: R688 



60 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1072 

608 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

GCCATTGATC 
GGGGAGCCGA 
GAATATCATG 
GAAAACAGTG 
5 AAGAGACGAA 
CTTTTAACCG 
AACCCGACGT 
ATTGAAGATG 
CGGCAAGATA 

10 ACACCGCCCA 
TATCACTTAT 
GGCGAAATAA 
GAGATTGTGC 
CTATATAGTT 

15 TTATTGTTAT 
G 
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TTACGCTTTA 
TTTGATTTTA 
CAATGAAGCG 
GGTTTGAAGC 
CCATACCCCA 
TTGCACGGAC 
ATGTAACTGG 
TAAGGCAGAG 
GCCTGATAAT 
TTGTCAACAG 
GAAAATTCAT 
TTGACTGTAA 
CGTTTAAGGG 
AAGGAGGAGA 
TCTTAATATA 



TCGATTAGAC 
TGGATGAACG 
CAAAATCGCA 
ATATAGCCTC 
ATAGCTATTT 
AAACTATATA 
TTCTTAGGGA 
CGATATTGCG 
AAGACTGATC 
GCAGTTCAGC 
CTACATTGGT 
TTTACGGGGC 
GAAGATTCTA 
CTGAAAATGA 
CTTAGGTTAT 



ACGGGTNAGC 
CTCTCATCAT 
GACGTTTGCG 
GAATGGTGGC 
TGATTTCCCC 
AGCTAACTCT 
ATTTATATAT 
GTCATTATCT 
GCATAGAGGG 
CTCGTTAAAT 
GATAATAGTA 
AAAACGGCAC 
GAAATATTTC 
AGAAGTTGTT 
GACTACGTTA 



TTGTACCAAT 
GCGGCAAATG 
CTCCATCATG 
ACTATGTATT 
GTTAAATAAA 
TTCGGCAGGA 
AGTAGATAGT 
GCGTGCGCTG 
GTGGTATTTC 
TCAGCATGGG 
AATCCAGTAG 
AATCTCAAAC 
ATACTTCCAA 
TTTTTTATTG 
ATGAAGCACT 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
751 



20 2) INFORMATION FOR SEQ ID NO: 1073 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 685 bases 

(B) TYPE: Nucleic acid 
25 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

30 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus flavescens 

(B) STRAIN: R689 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1073 



ATTTGATTTT ATGGATGAAC GCTCTCATCA TGCGGCAAAT GGAATATCAT 50 

GCAATGAAGC GCAAAATCGC AGACGTTTGC GCTCCATCAT GGAAAACAGT 100 

GGGTTTGAAG CATATAGCCT CGAATGGTGG CACTATGTAT TAAGAGACGA 150 

ACCATACCCC AATAGCTATT TTGATTTCCC CGTTAAATAA ACTTTTAACC 200 

4 0 GTTGCACGGA CAAACTATAT AAGCTAACTC TTTCGGCAGG AAACCCGACG 250 

TATGTAACTG GTTCTTAGGG AATTTATATA TAGTAGATAG TATTGAAGAT 300 

GTAAGGCAGA GCGATATTGC GGTCATTATC TGCGTGCGCT GCGGCAAGAT 350 

AGCCTGATAA TAAGACTGAT CGCATAGAGG GGTGGTATTT CACACCGCCC 4 00 

ATTGTCAACA GGCAGTTCAG CCTCGTTAAA TTCAGCATGG GTATCACTTA 450 

45 TGAAAATTCA TCTACATTGG TGATAATAGT AAATCCAGTA GGGCGAAATA 500 

ATTGACTGTA ATTTACGGGG CAAAACGGCA CAATCTCAAA CGAGATTGTG 550 

CCGTTTAAGG GGAAGATTCT AGAAATATTT CATACTTCCA ACTATATAGT 600 

TAAGGAGGAG ACTGAAAATG AAGAAGTTGT TTTTTTTATT GTTATTGTTA 650 

TTCTTAATAT ACTTAGGTTA TGACTACGTT AATGA 685 

50 



2) INFORMATION FOR SEQ ID NO: 1074 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 732 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

60 

609 
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(ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM : Enterococcus faecium 
5 (B) STRAIN: R690 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1074 

■ ATCGATTAGA CACGGGTGAG CTTGTACCAA TGGGGAGCCG ATTTGATTTT 50 

10 ATGGATGAAC GCTCTCATCA TGCGGCAAAT GGAATATCAT GCAATGAAGC 100 

GCAAAATCGC AGACGTTTGC GCTCCATCAT GGAAAACAGT GGGTTTGAAG 150 

CATATAGCCT CGAATGGTGG CACTATGTAT TAAGAGACGA ACCATACCCC 200 

AATAGCTATT TTGATTTCCC CGTTAAATAA ACTTTTAACC GTTGCACGGA 250 

CAAACTATAT AAGCTAACTC TTTCGGCAGG AAACCCGACG TATGTAACTG 300 

15 GTTCTTAGGG AATTTATATA TAGTAGATAG TATTGAAGAT GTAAGGCAGA 350 

GCGATATTGC GGTCATTATC TGCGTGCGCT GCGGCAAGAT AGCCTGATAA 4 00 

TAAGACTGAT CGCATAGAGG GGTGGTATTT CACACCGCCC ATTGTCAACA 450 

GGCAGTTCAG CCTCGTTAAA, TTCAGCATGG GTATCACTTA TGAAAATTCA 500 

TCTACATTGG TGATAATAGT AAATCCAGTA GGGCGAAATA ATTGACTGTA 550 

20 ATTTACGGGG CAAAACGGCA CAATCTCAAA CGAGATTGTG CCGTTTAAGG 600 

GGAAGATTCT AGAAATATTT CATACTTCCA ACTATATAGT TAAGGAGGAG 650 

ACTGAAAATG AAGAAGTTGT TTTTTTTATT GTTATTGTTA TTCTTAATAT 700 

ACTTAGGTTA TGACTACGTT AATGAAGCAC TG 732 



25 



40 



2) INFORMATION FOR SEQ ID NO: 1075 



(i) SEQUENCE CHARACTERISTICS: 
3 0 (A) LENGTH: 670 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

35 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus gallinarum 

(B) STRAIN: R691 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1075 



TCTCATCATG CGGCAAATGG AATATCATGC AATGAAGCGC AAAATCGCAG 50 

ACGTTTGCGC TCCATCATGG AAAACAGTGG GTTTGAAGCA TATAGCCTCG 100 

45 AATGGTGGCA CTATGTATTA AGAGACGAAC CATACCCCAA TAGCTATTTT 150 

GATTTCCCCG TTAAATAAAC TTTTAACCGT TGCACGGACA AACTATATAA 200 

GCTAACTCTT TCGGCAGGAA ACCCGACGTA TGTAACTGGT TCTTAGGGAA 250 

TTTATATATA GTAGATAGTA TTGAAGATGT AAGGCAGAGC GATATTGCGG 300 

TCATTATCTG CGTGCGCTGC GGCAAGATAG CCTGATAATA AGACTGATCG 350 

50 CATAGAGGGG TGGTATTTCA CACCGCCCAT TGTCAACAGG CAGTTCAGCC 400 

TCGTTAAATT CAGCATGGGT ATCACTTATG AAAATTCATC TACATTGGTG 4 50 

ATAATAGTAA ATCCAGTAGG GCGAAATAAT TGACTGTAAT TTACGGGGCA 500 

AAACGGCACA ATCTCAAACG AGATTGTGCC GTTTAAGGGG AAGATTCTAG 550 

AAATATTTCA TACTTCCAAC TATATAGTTA AGGAGGAGAC TGAAAATGAA 600 

55 GAAGTTGTTT TTTTTATTGT TATTGTTATT CTTAATATAC TTAGGTTATG 650 

ACTACGTTAA TGAAGCACTG 670 



60 2) INFORMATION FOR SEQ ID NO: 1076 

610 
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(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 948 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 

(B) STRAIN: DG131/3 serotype 0X3 :H8 

(C) ACCESSION NUMBER: Z36901 

(xi) SEQUENCE DESCRIPTION: SEQ, ID NO: 1076 



ATGAAAATAA 
TGTTAATGTG 
CGTATGTAGA 
CAGACTATTT 
AGGGGATAAT 
GGCGGTTTAA 
ACAGGATTTG 
TTCACATGTT 
GTAGCTATAC 
CAGATAAATC 
TAGCGGAACC 
TTACTGTGAC 
CGTACAACAC 
AGATGTTGAT 
ACTATCATGG 
GTTAATGCAA 
TGCATCGCGA 
CGGTAGATGG 
TCATCCACTC 



TAATTTTTAG 
GTTGCGAAGG 
TTCGCTGAAT 
CATCAGGAGG 
TTGTTTGCAG 
TAATCTACGG 
TTAACAGGAC 
ACCTTTCCTG 
CACGTTACAG 
GCCATTCGTT 
TCACTGACGC 
AGCTGAAGCT 
TTGATGATCT 
CTTACGTTGA 
ACAAGACTCT 
TTCTGGGTAG 
GTTGCCAGAA 
AAGAGTGCGT 
TGGGGGCAAT 



AGTGCTAACT 
AATTTACCTT 
GTCATTCGCT 
TACGTCTTTA 
TTGATGTCAG 
CTTATTGTTG 
AAATAATGTT 
GTACAACTGC 
CGTGTTGCGG 
GACTACTTCT 
AGTCTGTGGC 
TTACGTTTTC 
CAGTGGGCGT 
ACTGGGGAAG 
GTTCGTGTTG 
CGTGGCATTA 
TTGTACCTAA 
GGGATTACGC 
TTTGATACGC 



TTTTTCTTTG 
AGATTTCTCG 
CTGCAATAGG 
CTGATGATTG 
AGGGATAGAT 
AACGAAATAA 
TTTTATCGCT 
GGTTACATTG 
GGATCAGTCG 
TATCTGGATT 
AAGAGCGATG 
GGCAAATTCA 
TCTTATGTAA 
GTTGAGTAGT 
GAAGAATTTC 
ATACTGAATT 
TGAGTTTCCT 
ACAATAAAAT 
AGGGCTATTA 



TTATCTTTTC 50 

ACAGCAAAGA 100 

TACTCCATTA 150 

ATAGTGGCAC 200 

CCAGAGGAAG 2 50 

TTTATATGTG 300 

TTGCTGATTT 3 50 

TCTGGTGACA 4 00 

TACGGGGATG 450 

TAATGTCGCA 500 

TTACGGTTTG 550 

GAGGGGATTT 600 

TGACTGCTGA 650 

GTCCTGCCTG 700 

TTTTGGAAGT 750 

GTCATCATCA 800 

TCTATGTGCC 850 

ATTGTGGGAC 900 

GCAGTTGA 948 



2) INFORMATION FOR SEQ ID NO: 1077 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1259 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 

(B) STRAIN: 94C serotype 048:H21 

(C) ACCESSION NUMBER: extracted from Z37725 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1077 

CACCTGTATA TGAAGTGTAT ATTATTTAAA TGGGTACTGT GCCTGTTACT 50 

GGGTTTTTCT TCGGTATCCT ATTCCCGGGA GTTTACGATA GACTTTTCGA 100 

CCCAACAAAG TTATGTCTCT TCGTTAAATA GTATACGGAC AGAGATATCG 150 

ACCCCTCTTG AACATATATC TCAGGGGACC ACATCGGTGT CTGTTATTAA 200 

CCACACCCCA CCGGGCAGTT ATTTTGCTGT GGATATACGA GGGCTTGATG 250 

611 
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TCTATCAGGC GCGTTTTGAC 
TTATATGTGG CCGGGTTCGT 
TTCAGATTTT ACACATATAT 
CAACGGACAG CAGTTATACC 
5 TCCGGAATGC AAATCAGTCG 
AATGGAGTTC AGTGGTAATA 
TGCGTTTTGT CACTGTCACA 
AGAGAATTTC GTCAGGCACT 
GCCGGGAGAC GTGGACCTCA 

10 TTCCGGAGTA TCGGGGAGAG 
AATAATATAT CAGCGATACT 
TCATCAGGGG GCGCGTTCTG 
AATGTCAGAT AACTGGCGAC 
TGGGAAAGTA ATACAGCTGC 

15 ATATACAACG GGTAAATAAA 
GGTTTTATTT GCATTAGTTT 
AAGGTAAAAT TGAGTTTTCC 
AAGGTTGACG GGAAAGAATA, 
ACTGCAAAGT GCTCAGCTGA 

20 CCTGTGAATC AGGCTCCGGA 
GGCATAACC 



CATCTTCGGC TGATTATTGA 
TAATACGGCA ACAAATACTT 
CAGTGCCCGA TGTGACAACG 
ACTCTGCAAC GTGTCGCAGC 
TCACTCACTG GTTTCATCAT 
CAATGACCAG AGATGCATCC 
GCAGAAGCCT TACGCTTCAG 
GTCTGAAACT GCTCCTGTGT 
CTCTGAACTG GGGGCGAATC 
GATGGTGTCA GAGTGGGGAG 
GGGTACTGTG GCCGTTATAC 
TTCGCGCCGT GAATGAAGAG 
AGGCCCGTTA TAAAAATAAA 
AGCGTTTCTG AACAGAAAGT 
GGAGTTAAGT ATGAAGAAGA 
CTGTTAATGC AATGGCGGCG 
AAGTATAATG AGGATGACAC 
CTGGACCAGT CGCTGGAATC 
CAGGAATGAC TGTCACAATC 
TTTGCTGAAG TGCAGTTTAA 
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GCAAAATAAT 


300 


TCTACCGTTT 


350 


GTTTCCATGA 


400 


GCTGGAACGT 


450 


ATCTGGCGTT 


500 


AGAGCAGTTC 


550 


GCAGATACAG 


600 


ATACCATGAC 


650 


AGCAATGTGC 


700 


AATATCCTTT 


750 


TGAATTGCCA 


800 


AGTCAACCAG 


850 


CAATACATTA 


900 


CACAGTTTTT 


950 


TGTTTATGGC 


1000 


GATTGTGCTA 


1050 


ATTTACAGTG 


1100 


TGCAACCGTT 


1150 


AAATCCAGTA 


1200 


TAATGACTGA 


1250 




1259 



25 2) INFORMATION FOR SEQ ID NO: 1078 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 
30 (C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1078 

AGTTCTGCGT TTTGTCACTG TC 22 



40 



50 



55 



2) INFORMATION FOR SEQ ID NO: 1079 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 19 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1079 
CGGAAGCACA TTGCTGATT 19 

2) INFORMATION FOR SEQ ID NO: 1080 



(i) SEQUENCE CHARACTERISTICS: 
60 (A) LENGTH: 25 bases 

612 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CAOO/01150 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1080 
TATAGCTACT GTCACCAGAC AATGT 



2) INFORMATION FOR SEQ ID NO: 1081 

(i) SEQUENCE CHARACTERISTICS: , 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1081 
ATGTCAGAGG GATAGATCCA 



2) INFORMATION FOR SEQ ID NO: 1082 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1082 
TTGARCRAAA TAATTTATAT GTG 



2) INFORMATION FOR SEQ ID NO: 1083 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1083 
TGATGATGRC AATTCAGTAT 



613 
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2) INFORMATION FOR SEQ ID NO: 1084 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1084 
CCACGCCGCT TTGCTGATTT TTCACATGTT ACCGCGTGG 



2) INFORMATION FOR SEQ ID NO: 1085 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10 85 
CCACGCCACT GTCTGAAACT GCTCCTGTGC GTGG 



2) INFORMATION FOR SEQ ID NO: 1086 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1086 
CTACTCCCGC CTTTTGGGTT 



2) INFORMATION FOR SEQ ID NO: 1087 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1087 
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2) INFORMATION FOR SEQ ID NO: 1088 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ , ID NO: 1088 
TGCCGTTTCC TGTATCCGTC 



2) INFORMATION FOR SEQ ID NO: 1089 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1089 
ATCCACACGG GCTAGACCTC 



2) INFORMATION FOR SEQ ID NO: 1090 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1090 
AATAGCGCGG ACGAATTGGA C 



2) INFORMATION FOR SEQ ID NO: 1091 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1091 
5 AACGCGGCAC TGTTTCCCAA 20 



10 



40 



50 



2) INFORMATION FOR SEQ ID NO: 1092 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 
15 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1092 

20 

TCGGCAAGAC AATATGACAG C 21 



25 2) INFORMATION FOR SEQ ID NO: 1093 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 381 bases 

(B) TYPE: Nucleic acid 
30 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

35 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus saprophyticus 

(B) STRAIN: CSsa-165 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1093 



TAACGGGCGT CTCGATAGAA AAACACGTGA AAATCCCAAT GATTATAAAC 50 

AATCAATATA CGATTTTGCT GAAGCTGTAA CAAAAGGTAT TAAGGAACAA 100 

ACAAATAAAA ATTAATAGGC AACTTAACCA GAATCGTTAA AACTATATGA 150 

CGATTCTGGT TTTTTAAATT CAAAAAGTTT TCTAAAAAAT TTACCTGCTT 200 

45 TTTTAAAGTA TAGGTATAAA ATACAATTGA TTAAAATAGT AAAGGAAATG 250 

AATCATGAAA CAATTAACTA AGCCTTTATA CTTTTACCTA TTACTTTTTA 300 

TTACAACAAC ACTGATTGGC GCGTTACTAT TATATTTGCC AATCACAGGT 350 

AAACATCCTA TTGATTTTGT GGACGCCCGT T 381 



2) INFORMATION FOR SEQ ID NO: 1094 



(i) SEQUENCE CHARACTERISTICS: 
55 (A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

60 (ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1094 
GGTAAAACAG GTACTTCTAA CTA 



PCT/CAOO/01150 



2) INFORMATION FOR SEQ ID NO: 1095 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1095 
CGATAGAAGC AGCAGGACAA 



2) INFORMATION FOR SEQ ID NO: 1096 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1096 
CTGATGGATG CGGAAGATAC 



2) INFORMATION FOR SEQ ID NO: 1097 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1097 
TCYTCAAAAG GGATCACWAA AGTMAC 



2) INFORMATION FOR SEQ ID NO: 1098 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 23 bases 
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(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1098 
TCTTCAAAAT CGAAAAAGCC GTC 23 

2) INFORMATION FOR SEQ ID NO: 1099 



(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY:, Linear 

20 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1099 
TCAAAAGGGA TCACWAAAGT MAC 23 

25 

2) INFORMATION FOR SEQ ID NO: 1100 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

35 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1100 
40 GTAAAKCCCG GCATRGTRTT GATTTC 26 



2) INFORMATION FOR SEQ ID NO: 1101 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
50 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1101 
GACGGYTTTT TYGATTTTGA AGA 23 



60 2) INFORMATION FOR SEQ ID NO: 1102 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1102 
AAAAARTCGA TKCGAGCMAG ACC 



2) INFORMATION FOR SEQ ID NO: 1103 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1103 
ATCCCGCTAT GAAAACGATC 



2) INFORMATION FOR SEQ ID NO: 1104 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1104 
GGATCAACAC AGTAGAACCG 



2) INFORMATION FOR SEQ ID NO: 1105 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1105 
CTCCTACGAT TCTCTTGAYA AATCA 
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2) INFORMATION FOR SEQ ID NO: 1106 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: , SEQ ID NO: 1106 
CAACCGATCT CAACACCGGC AAT 



2) INFORMATION FOR SEQ ID NO: 1107 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1107 
CTCATTTGAC TTCCTCCTTT GCT 



2) INFORMATION FOR SEQ ID NO: 1108 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1108 
GTAAGAATCG GAAAAGCGGA AGG 



2) INFORMATION FOR SEQ ID NO: 1109 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1109 
ACATCGTGAT CGCTAAAAGG AGC 23 

5 

2) INFORMATION FOR SEQ ID NO: 1110 

L0 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear. 

L5 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1110 
20 ACGAGAAAGA CAACAGGAAG ACC 23 



2) INFORMATION FOR SEQ ID NO: 1111 



25 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
30 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1111 

35 

CTTTTTCCGG CTCGWYTTCC TGATG 25 



10 2) INFORMATION FOR SEQ ID NO: 1112 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 
15 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1112 

GGCTGYGATA TTCAAAGCTC 20 



55 



2) INFORMATION. FOR SEQ ID NO: 1113 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 
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(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1113 
ACCGACCTCA CAGCCCGAAA 20 



2) INFORMATION FOR SEQ ID NO: 1114 



(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

20 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1114 
TCWGAGCCTT TTTCCGGCTC G 21 

25 

2) INFORMATION FOR SEQ ID NO: 1115 

3 0 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

35 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1115 
40 TTTCGGGCTG TGAGGTCGGB TGHGCG 26 



2) INFORMATION FOR SEQ ID NO: 1116 



45 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
50 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1116 

55 

TTTCGGGCTG TGAGGTCGGB TGHGCGG 27 



60 2) INFORMATION FOR SEQ ID NO: 1117 
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L0 



L5 



20 



25 



30 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 801 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecium 

(B) STRAIN: 94 

(C) ACCESSION NUMBER: U94526 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1117 



AAATTCGATC 
ATGCAAGAAG 
TCTCCCCGGA 
TACGAAACAC 
CGGGGAGGAT 
ATGTGGGCTG 
GCCTACATTC 
GATTGATAAA 
TTGTGAAGCC 
GGTACGGAAG 
AAAAATCTTA 
TCATGGGGAA 
CTGAGCCACG 
CTCAGAAAAT 
GAAATCGGGT 
AGAGGGCTTG 
T 



CGCACTACAT 
CCATGTACGG, 
TAGGAAAACG 
GGCGTATTGA 
GGTGCGATAC 
TGATATTCAA 
TTACAAAAAA 
GGTGACAAGC 
GGCACGGTCA 
AACTTAACGC 
ATTGAGCAAG 
CGAGGATGAT 
GTATCTTCCG 
GCGATGATTA 
GCAGGAAACG 
CCCGTGTTGA 



CGGAATTACA 
AATGGGAAGC 
CATGGGCTGC 
TGTGGCTTTC 
AGGGGCTGTT 
AGCTCCGCAG 
TGCGGGCATC 
CGGAGGCGGG 
GGTTCGTCCT 
TGCGATAGAA 
CGATTTCGGG 
TTGATTGTCG 
CATCCATCAG 
CAGTTCCCGC 
GCAAAGAAAG 
TCTTTTTTTG 



AAAAACGGTG 
CGACAGTCTC 
TTGTCATGAA 
CCGGTTTTGC 
TGTATTGTCT 
CTTGCATGGA 
GCCGTTCCCG 
TGCGCTTACC 
TTGGCGTAAC 
GCGGCAGGAC 
CTGTGAGGTC 
GCGAAGTGGA 
GAAAACGAGC 
AGACATTCCG 
TATATCGGGT 
CAGGAGGATG 



TATGGAAGCT 
CCCGCCATAC 
AGAAAGCGAA 
ATGGCAAATG 
GGTATCCCCT 
CAAATCACTG 
AATTTCAAAT 
TACCCTGTCT 
CAAAGTAAAC 
AATATGATGG 
GGGTGTGCGG 
TCAAATCCGG 
CGGAAAAAGG 
GTCGAGGAAC 
GCTTGGATGC 
GCGGCATCGT 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
801 



35 



ao 



35 



2) INFORMATION FOR SEQ ID NO: 1118 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1118 
TTTTCWGAGC CTTTTTCCGG CTCG 



24 



2) INFORMATION FOR SEQ ID NO: 1119 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

50 
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(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1119 
TTTCGGGCTG TGAGGTCGGB TGHGC 



2) INFORMATION FOR SEQ ID NO: 1120 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1120 
TTTCGGGCTG TGAGGTCGGB TGHG 



2) INFORMATION FOR SEQ ID NO: 1121 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid ' 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1121 
TGTTTGWATT GTCYGGYATC CC 



2) INFORMATION FOR SEQ ID NO: 1122 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single • 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1122 
TGGTGCATTG CTACGTGG 



2) INFORMATION FOR SEQ ID NO: 1123 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 22 bases 
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(B) TYPE: Nucleic. acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1123 
TTTCGGGCTG TGAGGTCGGB TG 22 

2) INFORMATION FOR SEQ ID NO: 1124 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

20 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1124 
25 GATTTGRTCC ACYTCGCCRA CA 22 



30 



2) INFORMATION FOR SEQ ID NO: 1125 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
35 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1125 

40 

ACTCACAACT GGGATGGATG 2 0 

45 2) INFORMATION FOR SEQ ID NO: 1126 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 
50 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1126 

TTATGGTTGT GCTGGTTGAG G 21 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1127 
KCAAAYGCCA TTTCAAGTAA 



2) INFORMATION FOR SEQ ID NO: 1128 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1128 
GACGACYTTA TKGATATACA 



2) INFORMATION FOR SEQ ID NO: 1129 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1129 
ATGATGACHG AMATGATGAA AAC 



2) INFORMATION FOR SEQ ID NO: 1130 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1130 

626 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 
GACTATCCAA GCATGCATTA TG 



PCT/CA00/01150 

22 



5 2) INFORMATION FOR SEQ ID NO: 1131 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 
10 (C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1131 

CATCTGGAGC TACRTARCCA GT 22 



20 



30 



35 
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2) INFORMATION FOR SEQ ID NO: 1132 



(i) SEQUENCE CHARACTERISTICS: 
* (A) LENGTH: 20 bases 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1132 
AGTGAAAARA TGGCTGCTGC 20 

2) INFORMATION FOR SEQ ID NO: 1133 



(i) SEQUENCE CHARACTERISTICS: 
4 0 (A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY; Linear 

45 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1133 
CATCAAGAAC ACTGGCTAYG TAG 23 

50 

2) INFORMATION FOR SEQ ID NO: 1134 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY : Linear 
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(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1134 
CTAGATAGAG CTAAAACCTT CCT 



2) INFORMATION FOR SEQ ID NO: 1135 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1135 
CATTATGCAA ACGCCATTTC AAG 



2) INFORMATION FOR SEQ ID NO: 1136 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1136 
ACTTGTCCAC GTTSGATRTC T 



2) INFORMATION FOR SEQ ID NO: 1137 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1137 
AATTAATGGC TGCWGTTGAY GAA 



2) INFORMATION FOR SEQ ID NO: 1138 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1032 bases 
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(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus gallinarum 
(C) ACCESSION NUMBER: M75132 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1138 

ATGAAAAAAA TTGCCGTTTT ATTTGGAGGG AATTCTCCAG AATACTCAGT 50 

GTCACTAACC TCAGCAGCAA GTGTGATCCA AGCTATTGAC CCGCTGAAAT 100 

ATGAAGTAAT GACCATTGGC ATCGCACCAA CAATGGATTG GTATTGGTAT 150 

CAAGGAAACC TCGCGAATGT TCGCAATGAT ACTTGGCTAG AAGATCACAA 200 

AAACTGTCAC CAGCTGACTT TTTCTAGCCA AGGATTTATA TTAGGAGAAA 250 

AACGAATCGT CCCTGATGTC CTCTTTCCAG TCTTGCATGG GAAGTATGGC 300 

GAGGATGGCT GTATCCAAGG ACTGCTTGAA CTAATGAACC TGCCTTATGT 350 

TGGTTGCCAT GTCGCTGCCT CCGCATTATG TATGAACAAA TGGCTCTTGC 400 

ATCAACTTGC TGATACCATG GGAATCGCTA GTGCTCCCAC TTTGCTTTTA 450 

TCCCGCTATG AAAACGATCC TGCCACAATC GATCGTTTTA TTCAAGACCA 50 0 

TGGATTCCCG ATCTTTATCA AGCCGAATGA AGCCGGTTCT TCAAAAGGGA 550 

TCACAAAAGT AACTGACAAA ACAGCGCTCC AATCTGCATT AACGACTGCT 600 

TTTGCTTACG GTTCTACTGT GTTGATCCAA AAGGCGATAG CGGGTATTGA 650 

AATTGGCTGC GGCATCTTAG GAAATGAGCA ATTGACGATT GGTGCTTGTG 700 

ATGCGATTTC TCTTGTCGAC GGTTTTTTTG ATTTTGAAGA GAAATACCAA 750 

TTAATCAGCG CCACGATCAC TGTCCCAGCA CCATTGCCTC TCGCGCTTGA 800 

ATCACAGATC AAGGAGCAGG CACAGCTGCT TTATCGAAAC TTGGGATTGA 850 

CGGGTCTGGC TCGAATCGAT TTTTTCGTCA CCAATCAAGG AGCGATTTAT 900 

TTAAACGAAA TCAACACCAT GCCGGGATTT ACTGGGCACT CCCGCTACCC 950 

AGCTATGATG GCGGAAGTCG GGTTATCCTA CGAAATATTA GTAGAGCAAT 1000 

TGATTGCACT GGCAGAGGAG GACAAACGAT GA 1032 



2) INFORMATION FOR SEQ ID NO: 1139 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1768 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS; Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecium 

(B) STRAIN: BM4147 

(C) ACCESSION NUMBER: X56895 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 113 9 

GATATCGTTA CGCTTCATGT GCCGCTCAAT ACGGATACGC ACTATATTAT 50 

CAGCCACGAA CAAATACAGA GAATGAAGCA AGGAGCATTT CTTATCAATA 100 

CTGGGCGCGG TCCACTTGTA GATACCTATG AGTTGGTTAA AGCATTAGAA 150 

AACGGGAAAC TGGGCGGTGC CGCATTGGAT GTATTGGAAG GAGAGGAAGA 200 

GTTTTTCTAC TCTGATTGCA CCCAAAAACC AATTGATAAT CAATTTTTAC 250 

TTAAACTTCA AAGAATGCCT AACGTGATAA TCACACCGCA TACGGCCTAT 300 

TATACCGAGC AAGCGTTGCG TGATACCGTT GAAAAAACCA TTAAAAACTG 350 
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TTTGGATTTT GAAAGGAGAC AGGAGCATGA ATAGAATAAA AGTTGCAATA 4 00 

CTGTTTGGGG GTTGCTCAGA GGAGCATGAC GTATCGGTAA AATCTGCAAT 4 50 

AGAGATAGCC GCTAACATTA ATAAAGAAAA ATACGAGCCG TTATACATTG 500 

GAATTACGAA ATCTGGTGTA TGGAAAATGT GCGAAAAACC TTGCGCGGAA 550 

5 TGGGAAAACG ACAATTGCTA TTCAGCTGTA CTCTCGCCGG ATAAAAAAAT 600 

GCACGGATTA CTTGTTAAAA AGAACCATGA ATATGAAATC AACCATGTTG 650 

ATGTAGCATT TTCAGCTTTG CATGGCAAGT CAGGTGAAGA TGGATCCATA 700 

CAAGGTCTGT TTGAATTGTC CGGTATCCCT TTTGTAGGCT GCGATATTCA 750 

AAGCTCAGCA ATTTGTATGG ACAAATCGTT GACATACATC GTTGCGAAAA 800 

LO ATGCTGGGAT AGCTACTCCC GCCTTTTGGG TTATTAATAA AGATGATAGG 850 

CCGGTGGCAG CTACGTTTAC CTATCCTGTT TTTGTTAAGC CGGCGCGTTC 900 

AGGCTCATCC TTCGGTGTGA AAAAAGTCAA TAGCGCGGAC GAATTGGACT 950 

ACGCAATTGA ATCGGCAAGA CAATATGACA GCAAAATCTT AATTGAGCAG 1000 

GCTGTTTCGG GCTGTGAGGT CGGTTGTGCG GTATTGGGAA ACAGTGCCGC 1050 

15 GTTAGTTGTT GGCGAGGTGG ACCAAATCAG ,GCTGCAGTAC GGAATCTTTC 1100 

GTATTCATCA GGAAGTCGAG CCGGAAAAAG GCTCTGAAAA CGCAGTTATA 1150 

ACCGTTCCCG CAGACCTTTC AGCAGAGGAG CGAGGACGGA TACAGGAAAC 1200 

GGCAAAAAAA ATATATAAAG CGCTCGGCTG TAGAGGTCTA GCCCGTGTGG 1250 

ATATGTTTTT ACAAGATAAC GGCCGCATTG TACTGAACGA AGTCAATACT 13 00 

20 CTGCCCGGTT TCACGTCATA CAGTCGTTAT CCCCGTATGA TGGCCGCTGC 1350 

AGGTATTGCA CTTCCCGAAC TGATTGACCG CTTGATCGTA TTAGCGTTAA 1400 

AGGGGTGATA AGCATGGAAA TAGGATTTAC TTTTTTAGAT GAAATAGTAC 1450 

ACGGTGTTCG TTGGGACGCT AAATATGCCA CTTGGGATAA TTTCACCGGA 1500 

AAACCGGTTG ACGGTTATGA AGTAAATCGC ATTGTAGGGA CATACGAGTT 1550 

2 5 GGCTGAATCG CTTTTGAAGG CAAAAGAACT GGCTGCTACC CAAGGGTACG 1600 

GATTGCTTCT ATGGGACGGT TACCGTCCTA AGCGTGCTGT AAACTGTTTT 1650 

ATGCAATGGG CTGCACAGCC GGAAAATAAC CTGACAAAGG AAAGTTATTA 1700 

TCCCAATATT GACCGAACTG AGATGATTTC AAAAGGATAC GTGGCTTCAA 1750 

AATCAAGCCA TAGCCGCG 1768 

30 



2) INFORMATION FOR SEQ ID NO: 1140 



35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1086 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus casseliflavus 
45 (C) ACCESSION NUMBER: L29638 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 114 0 



GTAAGAATCG GAAAAGCGGA 

50 ATTTTTGGAG GCAATTCACC 

TAGCGCAATC GAAGCACTCC 

TGATCGGGAT CGCCCCAGAT 

CTGGAAAACA TCCGACAAGA 

GAAAATACAG CCGCTATTCG 

55 AGCAAACGTT GGTACCTGAT 

GGGGAAGATG GCAGTATCCA 

TGTAGGCTGC GGGGTGGCAG 

TGCATCAAGC TGCAGCAGCC 

TTGACAAATC AAGCCAACCA 

60 CCATGGCTTC CCAGTTTTCT 



AGGAAGAAAA ACATGAAAAA 
GGAATACACC GTTTCTTTAG 
AATCATCTCC CTATGACTAC 
GCTATGGATT GGTACTTGTA 
CACGTGGTTG TTGGATACGA 
AAGGAAACGG CTTTTGGCTA 
GTTTTATTTC CCATTATGCA 
AGGATTGTTT GAATTGATGA 
GTTCTGCCTT ATGTATGAAC 
ATTGGCGTAC AAAGTGCTCC 
GCAAGAACAA ATCGAAGCTT 
TTAAGCCTAA TGAAGCGGGC 
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AATCGCCATT 


50 


CTTCAGCAAC 


100 


GACCTCTCTT 


150 


TACAGGAGAA 


200 


AACATAAACA 


250 


AGTGAAGAGC 


300 


TGGCAAATAC 


350 


AGCTGCCTTA 


400 


AAATGGCTGC 


450 


TACGATTCTC 


500 


TTATCCAGAC 


550 


TCCTCAAAAG 


600 
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GGATCACTAA AGTCACCTGC 
GCCTTTACTT ATTGTTCCGC 
TGAGATCGGT TGCGGTATTT 
GTGACGCCAT TTCATTAGTA 
5 CAGCTGATCA GCGCCAAAAT 
TGAAACCAAG GTCAAAGAAC 
TTAAAGGTCT TGCTCGCATC 
TACTTGAATG AAATCAATAC 
TCCTGCCATG ATGGCAGCGG 
LO AACTGCTTGT CTTAGCAAAG 



GTTGAAGAAA TCGCTTCTGC 
AGTGCTCCTA CAAAAAAATA 
TGGGCAACGA CTCTTTGACT 
GACGGCTTTT TCGATTTTGA 
CACCGTCCCT GCGCCATTGC 
AAGCTCAGCT GCTCTATCGT 
GACTTTTTTG TCACGGAGCG 
TATGCCGGGC TTTACGAGTC 
TCGGCTTATC CTATCAAGAA 
GAGGAAGTCA AATGAG 



PCT/CA0O/0J150 

CTTAAAAGAA 650 

TTGCCGGTGT 700 

GTCGGTGCTT 750 

AGAAAAGTAC 800 

CTGAAACGAT 850 

AGTCTTGGTC 900 

AGGAGAACTA 950 

ACTCCCGCTA 1000 

CTACTACAAA 1050 
1086 



2) INFORMATION FOR SEQ ID NO: 1141 

L5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 94 6 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecium 
25 (B) STRAIN: BM4147 

(C) ACCESSION NUMBER: extracted from M97297 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1141 

30 ATGAATAACA TCGGCATTAC TGTTTATGGA TGTGAGCAGG ATGAGGCAGA 50 

TGCATTCCAT GCTCTTTCGC CTCGCTTTGG CGTTATGGCA ACGATAATTA 100 

ACGCCAACGT GTCGGAATCC AACGCCAAAT CCGCGCCTTT CAATCAATGT 150 

ATCAGTGTGG GACATAAATC AGAGATTTCC GCCTCTATTC TTCTTGCGCT 200 

GAAGAGAGCC GGTGTGAAAT ATATTTCTAC CCGAAGCATC GGCTGCAATC 2 50 

3 5 ATATAGATAC AACTGCTGCT AAGAGAATGG GCATCACTGT CGACAATGTG 3 00 

GCGTACTCGC CGGATAGCGT TGCCGATTAT ACTATGATGC TAATTCTTAT 350 

GGCAGTACGC AACGTAAAAT CGATTGTGCG CTCTGTGGAA AAACATGATT 4 00 

TCAGGTTGGA CAGCGACCGT GGCAAGGTAC TCAGCGACAT GACAGTTGGT 450 

GTGGTGGGAA CGGGCCAGAT AGGCAAAGCG GTTATTGAGC GGCTGCGAGG 500 

4 0 ATTTGGATGT AAAGTGTTGG CTTATAGTCG CAGCCGAAGT ATAGAGGTAA 550 

ACTATGTACC GTTTGATGAG TTGCTGCAAA ATAGCGATAT CGTTACGCTT 600 

CATGTGCCGC TCAATACGGA TACGCACTAT ATTATCAGCC ACGAACAAAT 650 

ACAGAGAATG AAGCAAGGAG CATTTCTTAT CAATACTGGG CGCGGTCCAC 700 

TTGTAGATAC CTATGAGTTG GTTAAAGCAT TAGAAAACGG GAAACTGGGC 750 

45 GGTGCCGCAT TGGATGTATT GGAAGGAGAG GAAGAGTTTT TCTACTCTGA 800 

TTGCACCCAA AAACCAATTG ATAATCAATT TTTACTTAAA CTTCAAAGAA 850 

TGCCTAACGT GATAATCACA CCGCATACGG CCTATTATAC CGAGCAAGCG 900 

TTGCGTGATA CCGTTGAAAA AACCATTAAA AACTGTTTGG ATTTTGAAAG ' 950 

GAGACAGGAG CATGAATAGA ATAAAAGTTG CAATACTGTT TGGGGGTTGC 1000 

50 TCAGAGGAGC ATGACGTATC GGTAAAATCT GCAATAGAGA TAGCCGCTAA 1050 

CATTAATAAA GAAAAATACG AGCCGTTATA CATTGGAATT ACGAAATCTG 1100 

GTGTATGGAA AATGTGCGAA AAACCTTGCG CGGAATGGGA AAACGACAAT 1150 

TGCTATTCAG CTGTACTCTC GCCGGATAAA AAAATGCACG GATTACTTGT 1200 

TAAAAAGAAC CATGAATATG AAATCAACCA TGTTGATGTA GCATTTTCAG 1250 

55 CTTTGCATGG CAAGTCAGGT GAAGATGGAT CCATACAAGG TCTGTTTGAA 1300 

TTGTCCGGTA TCCCTTTTGT AGGCTGCGAT ATTCAAAGCT CAGCAATTTG 1350 

TATGGACAAA TCGTTGACAT ACATCGTTGC GAAAAATGCT GGGATAGCTA 14 00 

CTCCCGCCTT TTGGGTTATT AATAAAGATG ATAGGCCGGT GGCAGCTACG 14 50 

TTTACCTATC CTGTTTTTGT TAAGCCGGCG CGTTCAGGCT CATCCTTCGG 1500 

60" TGTGAAAAAA GTCAATAGCG CGGACGAATT GGACTACGCA ATTGAATCGG 1550 
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CAAGACAATA TGACAGCAAA ATCTTAATTG AGCAGGCTGT TTCGGGCTGT 1600 

GAGGTCGGTT GTGCGGTATT GGGAAACAGT GCCGCGTTAG TTGTTGGCGA 1650 

GGTGGACCAA ATCAGGCTGC AGTACGGAAT CTTTCGTATT CATCAGGAAG 1700 

TCGAGCCGGA AAAAGGCTCT GAAAACGCAG TTATAACCGT TCCCGCAGAC 1750 

CTTTCAGCAG AGGAGCGAGG ACGGATACAG GAAACGGCAA AAAAAATATA 1800 

TAAAGCGCTC GGCTGTAGAG GTCTAGCCCG TGTGGATATG TTTTTACAAG 1850 

ATAACGGCCG CATTGTACTG AACGAAGTCA ATACTCTGCC CGGTTTCACG 1900 

TCATACAGTC GTTATCCCCG TATGATGGCC GCTGCAGGTA TTGCACTTCC 1950 

CGAACTGATT GACCGCTTGA TCGTATTAGC GTTAAAGGGG TGATAAGCAT 2000 

GGAAATAGGA TTTACTTTTT TAGATGAAAT AGTACACGGT GTTCGTTGGG 2050 

ACGCTAAATA TGCCACTTGG GATAATTTCA CCGGAAAACC GGTTGACGGT 2100 

TATGAAGTAA ATCGCATTGT AGGGACATAC GAGTTGGCTG AATCGCTTTT 2150 

GAAGGCAAAA GAACTGGCTG CTACCCAAGG GTACGGATTG CTTCTATGGG 2200 

ACGGTTACCG TCCTAAGCGT GCTGTAAACT GTTTTATGCA ATGGGCTGCA 2250 

CAGCCGGAAA ATAACCTGAC AAAGGAAAGT, TATTATCCCA ATATTGACCG 2300 

AACTGAGATG ATTTCAAAAG GATACGTGGC TTCAAAATCA AGCCATAGCC 2350 

GCGGCAGTGC CATTGATCTT ACGCTTTATC GATTAGACAC GGGTGAGCTT 24 00 

GTACCAATGG GGAGCCGATT TGATTTTATG GATGAACGCT CTCATCATGC 24 50 

GGCAAATGGA ATATCATGCA ATGAAGCGCA AAATCGCAGA CGTTTGCGCT 2500 

CCATCATGGA AAACAGTGGG TTTGAAGCAT ATAGCCTCGA ATGGTGGCAC 2550 

TATGTATTAA GAGACGAACC ATACCCCAAT AGCTATTTTG ATTTCCCCGT 2600 

TAAATAAACT TTTAACCGTT GCACGGACAA ACTATATAAG CTAACTCTTT 2650 

CGGCAGGAAA CCCGACGTAT GTAACTGGTT CTTAGGGAAT TTATATATAG 2700 

TAGATAGTAT TGAAGATGTA AGGCAGAGCG ATATTGCGGT CATTATCTGC 27 50 

GTGCGCTGCG GCAAGATAGC CTGATAATAA GACTGATCGC ATAGAGGGGT 2800 

GGTATTTCAC ACCGCCCATT GTCAACAGGC AGTTCAGCCT CGTTAAATTC 2850 

AGCATGGGTA TCACTTATGA AAATTCATCT ACATTGGTGA TAATAGTAAA 2900 

TCCAGTAGGG CGAAATAATT GACTGTAATT TACGGGGCAA AACGGCACAA 2950 

TCTCAAACGA GATTGTGCCG TTTAAGGGGA AGATTCTAGA AATATTTCAT 30 00 

ACTTCCAACT ATATAGTTAA GGAGGAGACT GAAAATGAAG AAGTTGTTTT 3050 

TTTTATTGTT ATTGTTATTC TTAATATACT TAGGTTATGA CTACGTTAAT 3100 

GAAGCACTGT TTTCTCAGGA AAAAGTCGAA TTTCAAAATT ATGATCAAAA 3150 

TCCCAAAGAA CATTTAGAAA ATAGTGGGAC TTCTGAAAAT ACCCAAGAGA 32 00 

AAACAATTAC AGAAGAACAG GTTTATCAAG GAAATCTGCT ATTAATCAAT 3250 

AGTAAATATC CTGTTCGCCA AGAAAGTGTG AAGTCAGATA TCGTGAATTT 33 00 

ATCTAAACAT GACGAATTAA TAAATGGATA CGGGTTGCTT GATAGTAATA 3350 

TTTATATGTC AAAAGAAATA GCACAAAAAT TTTCAGAGAT GGTCAATGAT 34 00 

GCTGTAAAGG GTGGCGTTAG TCATTTTATT ATTAATAGTG GCTATCGAGA 34 50 

CTTTGATGAG CAAAGTGTGC TTTACCAAGA AATGGGGGCT GAGTATGCCT 3500 

TACCAGCAGG TTATAGTGAG CATAATTCAG GTTTATCACT AGATGTAGGA 3550 

TCAAGCTTGA CGAAAATGGA ACGAGCCCCT GAAGGAAAGT GGATAGAAGA 3600 

AAATGCTTGG AAATACGGGT TCATTTTACG TTATCCAGAG GACAAAACAG 3650 

AGTTAACAGG AATTCAATAT GAACCATGGC ATATTCGCTA TGTTGGTTTA 37 00 

CCACATAGTG CGATTATGAA AGAAAAGAAT TTCGTTCTCG AGGAATATAT 3750 

GGATTACCTA AAAGAAGAAA AAACCATTTC TGTTAGTGTA AATGGGGAAA 38 00 

AATATGAGAT CTTTTATTAT CCTGTTACTA AAAATACCAC CATTCATGTG 3850 

CCGACTAATC TTCGTTATGA GATATCAGGA AACAATATAG ACGGTGTAAT 3900 

TGTGACAGTG TTTCCCGGAT CAACACATAC TAATTCAAGG AGGTAA 3946 



2) INFORMATION FOR SEQ ID NO: 1142 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1142 
GATCCTCTAA ATGATTCTCA GGTGG 
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2) INFORMATION FOR SEQ ID NO: 1143 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear, 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1143 
CAATTAGCTT AGCAATAGGT GTTGG 



2) INFORMATION FOR SEQ ID NO: 1144 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1144 
TGTYTTCCAA GGTTCAGCTC 



2) INFORMATION FOR SEQ ID NO: 1145 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1145 
AACATATTKG GTTGATAGGT 



2) INFORMATION FOR SEQ ID NO: 1146 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 
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(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 114 6 
GGGATTACCT ATGCCAATAT GAT 



2 ) INFORMATION FOR SEQ ID NO: 1147 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1147 
AGCTGTGTTA GCVCGAACAT CTTG 



2) INFORMATION FOR SEQ ID NO : 1148 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1148 
GACTTTGTTT GGCGTGATAT 



2) INFORMATION FOR SEQ ID NO: 1149 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 114 9 
TCCYACWATT TCTTTTTGWG 



2) INFORMATION FOR SEQ ID NO: 1150 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1150 
TGATAATCAC ACCGCATACG 



2) INFORMATION FOR SEQ ID NO: 1151 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1151 
TGCTGTCATA TTGTCTTGCC 



2) INFORMATION FOR SEQ ID NO: 1152 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1152 
ATAAAGATGA TAGGCCGGTG 



2) INFORMATION FOR SEQ ID NO: 1153 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1153 
CTCGTATGTC CCTACAATGC 
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2) INFORMATION FOR SEQ ID NO: 1154 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1154 
GTTTGAAGCA TATAGCCTCG 



2) INFORMATION FOR SEQ ID NO: 1155 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1155 
CAGTGCTTCA TTAACGTAGT C 



2) INFORMATION FOR SEQ ID NO: 1156 

(i) SEQUENCE CHARACTERISTICS: \ 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

( D ) TOPOLOGY : Li near 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1156 
GTTGAAATGC ATCACGAACA ATT 



2) INFORMATION FOR SEQ ID NO: 1157 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1157 
AAGAACGTTT CAGTTAAGGA AAT 
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2) INFORMATION FOR SEQ ID NO: 1158 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1158 
AAGAGGTAAT GTCTGTGGT 



2) INFORMATION FOR SEQ ID NO: 1159 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1159 
TGAAGGTTTG CCAGGTGA 



2) INFORMATION FOR SEQ ID NO: 1160 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1160 
CGTTTCTGTT AAAGAAATTA GAAG 



2) INFORMATION FOR SEQ ID NO: 1161 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 
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(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1161 
TCCAGGTGAT AACGTTGG 



2) INFORMATION FOR SEQ ID NO: 1162 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1162 
CAAGTCCGTG GAAATGCA 



2) INFORMATION FOR SEQ ID NO: 1163 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1163 
GTTGGTTTCA ACGTTAAGAA C 



2) INFORMATION FOR SEQ ID NO: 1164 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1164 
GGTTTCAACG TCAAGAAC 



2) INFORMATION FOR SEQ ID NO: 1165 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1165 
GGTTTCAACG TGAAGAAC 



2) INFORMATION FOR SEQ ID NO: 1166 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1166 
ACGTTAAGAA TGTTTCTGTC AA 



2) INFORMATION FOR SEQ ID NO: 1167 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1167 
GTTGGTTTCA ACGT 



2) INFORMATION FOR SEQ ID NO: 1168 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1168 
GAACAATTGG TTGAAGGTGT 

639 
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2) INFORMATION FOR SEQ ID NO: 1169 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2160 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
10 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
15 (A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: SP-665 

(C) ACCESSION NUMBER: AF139883 



20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1169 



ATGAACAAAC CAACGATTCT GCGCCTAATC AAGTATCTGA GCATTAGCTT 50 

CTTAAGCTTG GTTATCGCAG CCATTGTCTT AGGCGGAGGA GTTTTTTTCT 100 

ACTACGTTAG CAAGGCTCCT AGCCTATCCG AGAGTAAACT AGTTGCAACA 150 

ACTTCTAGTA AAATCTACGA CAATAAAAAT CAACTCATTG CTGACTTGGG 200 

25 TTCTGAACGC CGCGTCAATG CCCAAGCTAA TGATATTCCC ACAGATTTGG 250 

TTAAGGCAAT CGTTTCTATC GAAGACCATC GCTTCTTCGA CCACAGGGGG 300 

ATTGATTCCA TCCGTATCCT GGGAGCTTTC TTGCGCAATC TGCAAAGTAA 350 

TTCCCTCCAA GGTGGATCAA CTCTCACCCA ACAGTTGATT AAGTTGACTT 4 00 

ACTTTTCAAC CTCGACTTCC GACCAGACTA TTTCTCGTAA GGCTCAGGAA 450 

30 GCTTGGTTAG CGATTCAGTT AGAACAAAAA GCAACCAAAC AGGAAATCTT 500 

GACCTACTAT ATAAATAAGG TCTACATGTC TAATGGCAAC TATGGAATGC 550 

AGACAGCAGC TCAAAACTAC TATGGTAAAG ACCTCAATAA TTTAAGTTTA 600 

CCTCAGTTAG CCTTGCTGGC TGGAATGCCT CAGGCACCAA ACCAATATGA 650 

CCCCTATTCA CATCCAGAAG CAGCCCAAGA CCGCCGAAAC TTGGTCTTAT 7 00 

35 CTGAAATGAA AAATCAAGGT TACATCTCTG CTGAACAGTA TGAGAAAGCA 750 

GTCAATACAC CAATTACTGA TGGACTACAA AGTCTCAAAT CAGCAAGTAA 800 

TTACCCTGCT TACATGGATA ATTACCTCAA GGAGGTCATC AATCAAGTAG 850 

AACAAGAAAC TGGCTATAAC CTTCTAACTA CTGGGATGGA TGTTTACACA 900 

AATGTAGACC AAGAAGCTCA AAAACATCTG TGGGATATCT ACAACTCCGA 950 

4 0 TCAATACGTC TCTTACCCTG ACGATGATTT GCAAGTCGCA TCTACGGTCG 1000 

TAGATGTTTC AAATGGTAAA GTCATCGCCC AACTTGGAGC TCGTCACCAA 1050 

GCAAGTAACG TTTCATTTGG TACCAACCAA GCTGTGGAAA CCAATCGTGA 1100 

CTGGGGTTCT GCTATGAAAC CAATCACCGA TTATGCACCT GCCATAGAAT 1150 

ACGGTGTTTA TGATTCCACT GCAACTATGG TTAATGATAT TCCTTATAAC 1200 

4 5 TATCCGGGAA CAAGCACACC TGTCTACAAC TGGGATAGAG CATATTTCGG 1250 

TAATATTACT CTGCAATATG CTCTTCAACA ATCACGAAAT GTCACAGCCG 1300 

TTGAGACTTT GAATAAGGTC GGTCTAGATA GAGCTAAAAC CTTCCTTAAT 1350 

GGTCTTGGTA TCGACTATCC AAGCATGCAT TATGCAAACG CCATTTCAAG 14 00 

TAATACAACA GAATCTAATA AACAATACGG AGCAAGTAGT GAAAAAATGG 1450 

50 CTGCTGCTTA TGCTGCCTTT GCAAATGGTG GCACTTACTA TAAACCAATG 1500 

TATATCCATA AAGTCGTCTT CAGTGATGGA AGTAAAAAAG AGTTCTCTAA 1550 

TGTCGGAACT CGTGCCATGA AGGAAACGAC AGCCTATATG ATGACCGACA 1600 

TGATGAAAAC AGTCTTGACT TATGGAACTG GGCGTGGAGC CTATCTTCCT 1650 

TGGCTTCCTC AAGCTGGTAA AACAGGAACC TCTAACTATA CAGATGAGGA 1700 

55 AGTTGAAAAC CACATCAAGA ACACTGGCTA TGTAGCTCCA GATGAAATGT 1750 

TTGTTGGTTA TACTCGTAAG TATTCTATGG CTGTATGGAC AGGTTATTCG 1800 

AATCGTTTAA CTCCTATCGT TGGAGATGGT TTCCTAGTTG CAGCTAAAGT 1850 

TTATCGCTCA ATGATAACGT ATCTATCAGA AGATACTCAT CCAGAAGACT 1900 

GGACGATGCC AGACGGACTT TTCAGAAACG GGGAATTTGT ATTCAAAAAT 1950 

60 GGAGCTCGCC CAATATGGAC TGAACCCTCT ACTCAACAAT CCTCAACAGC 2000 
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ACAACCATAA 



PCT/CA00/01150 

AACTTCACAG TCTAGCTCAA 2050 
CCAATCCTAA CAATAATACG 2100 
AATCAGAATC CTCAACCAGC 2150 

2160 



2) INFORMATION FOR SEQ ID NO: 1170 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1170 
ACGAATTGGA CTACGCAATT 



2) INFORMATION FOR SEQ ID NO: 1171 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1171 
ACGAGGATGA TTTGATTGTC 



2) INFORMATION FOR SEQ ID NO: 1172 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1560 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE. TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: 64147 

(C) ACCESSION NUMBER: X13136 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1172 

GATCCTCTAA ATGATTCTCA GGTGGCTGTT ATTGCCTCTA TTTCAAAGGA 50 

GATGCCTGGC ATTAGTATTT CTACTTCTTG GGATAGAAAG GTTTTGGAAA 100 

CTTCCCTTTC TTCTATAGTT GGGAGTGTAT CCAGTGAAAA AGCTGGTCTC 150 

CCAGCGGAAG AAGCAGAAGC CTATCTTAAA AAAGGCTATT CTCTAAATGA 200 
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CCGTGTAGGA ACCTCCTATT TGGAAAAGCA ATATGAAGAG ACCTTACAAG 250 

GAAAACGCTC GGTAAAAGAA ATCCATCTGG ACAAATATGG CAACATGGAA 300 

AGCGTGGATA CAATTGAGGA AGGTAGTAAG GGAAACAATA TCAAGCTGAC 350 

CATTGATTTG GCCTTCCAAG ATAGCGTGGA TGCTTTGCTG AAAAGTTATT 400 

5 TCAATTCAGA GTTGGGAAAT GGTGGAGCCA AGTATTCTGA AGGTGTCTAT 450 

GCAGTCGCCC TTAACCCAAA AACAGGTGCT GTTTTGTCTA TGTCAGGAAT 500 

TAAACATGAC TTGAAAACAG GAGAGTTGAC GCCGGATTCC TTGGGAACGG 550 

TAACCAATGT CTTTGTCCCA GGTTCGGTTG TCAAAGCAGC GACCATCAGC 600 

TCTGGTTGGG AAAATGGAGT CTTGTCAGGA AATCAGACCT TGACAGACCA 650 

LO* GTCCATTGTC TTTCAAGGTT CAGCTCCAAT TAATTCTTGG TATCCTGCCT 700 

TTTCTAGACC AATGCCGATT ACGGCGGTTC AGGCTCTAGA GTATTCATCC 750 

AATGCTTATA TGGTCCAAAC AGCCCTAGGT CTTATGGGGC AGACCTATCA 800 

ACCCAATATG TTTGTCGGCA CCAGCAATCT AGAGTCTGCT ATGGGGAAAT 850 

TGCGTTCAAC CTTTGGTGAA TATGGTTTGG GTTCTGCGAC CGGAATTGAC 900 

L5 CTACCAGATG AATCTACTGG ATTTGTTCCC , AAAGAGTATA GCTTTGCTAA 950 

TTTCATTACC AATGCCTTTG GGCAGTTTGA TAACTATACG CCGATGCAGT 1000 

TGGCTCAGTA TGTAGCAACT ATTGCAAATG ATGGTGTTCG TGTGGCTCCT 1050 

CGTATTGTTG AAGGCATTTA TGGTAATAAT GATAAGGGAG GACTGGGTGA 1100 

CTTGATTCAG CAACTGCAAC CGACAGAGAT GAATAAGGTC AATATA TCCG 1150 

20 ACTCCGATAT GAGCATCTTG CACCAAGGTT TTTATCAGGT TGCCCATGGT 1200 

ACTAGTGAAT TGACAACTGG ACGTGCCTTT TCAAATGGCG CCTTGGTATC 1250 

CATTAGCGGA AAAACAGGTA CAGCCGAAAG CTATGTGGCA GATGGTCAGC 1300 

AAGCAACCAA TACCAATGCG GTGGCCTATG CCCCATCTGA TAATCCCCAA 1350 

ATCGCTGTAG CTGTTGTCTT CCCTCATAAC ACCAACCTTA CAAATGGTGT 1400 

25 CGGACCTTCC ATTGCGCGCG ATATTATCAA CCTCTATAAC CAACATCATC 1450 

CAATGAATTA GAAAGGAACA TATGCTTTAT CCAACACCTA TTGCTAAGCT 1500 

AATTGACAGT TATTCGAAGT TACCGGGTAT CGGGATTAAG ACGGCTACCC 1550 

GTTTGGCCTT 1560 



30 



2) INFORMATION FOR SEQ ID NO: 1173 



(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 2 007 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

40 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: CS109 

45 (C) ACCESSION NUMBER: 249096 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1173 

GAGTCCCGAT TGCTGAGGAT GCAACCTCCT ATAATGTCTA TGCGGTCATT 50 

50 GATGAGAACT ATAAGTCAGC AACGGGTAAG ATTCTTTACG TAGAAAAAAC 100 

ACAATTTAAC AAGGTTGCAG AGGTCTTTCA TAAGTATCTG GACATGGAAG 150 

AATCCTATGT AAGAGAGCAA CTCTCGCAAC CTAATCTCAA GCAAGTTTCC 200 

TTTGGAGCAA AGGGAAATGG GATTACCTAT GCCAATATGA TGACTATCAA 250 

AAAAGAGTTG GAAACTGCAG AGGTCAAGGG GATTGATTTT ACAACCAGTC 300 

55 CCAATCGTAG TTATCCAAAC GGACAATTTG CTTCTAGTTT TATCGGTCTA 350 

GCTCAGCTCC ATGAAAATGA AGATGGCAGC AAGAGCTTGC TGGGAACTTC 400 

TGGAATGGAG AGTTCCTTGA ACAGTATTCT TGCAGGGACA GACGGCATTA 450 

TTACCTATGA AAAGGATCGT CTGGGCAATA TTGTACCCGG AACAGAACTG 500 

GTATCGCAAC AAACTGTGGA TGGCAAGGAT GTTTATACAA CATTGTCTAG 550 

60 TCCGCTACAA TCTTTCATGG AAACTCAGAT GGATGCCTTT CTAGAAAAAG 600 
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TAAAAGGTAA 
ATTCTCGCTA 
AATCACTGAG 
AACCAGGATC 
AATACCTTCC 
GGATGCGACG 
GGATGATGAC 
AGTCTACTTG 
ACGCTTTAAA 
CTGGTCAACT 
GGGCAAGGAA 
TATTGCTAAT 
ATGATACTAA 
AATCCTGTTT 
AGTTGGGACG 
AGCCAATTAT 
GCTCAAATCG 
TTATATTTTC 
TCTTGTATGT 
GGAGAATTTG 
TCTCAATCTT 
CTTCTTATGC 
GAAGCCTTAC 
AAAGATTAAA 
AACAAGTTCT 
AGCTGGAAAA 
ACTGGAATTT 
CTAATACAGC 
TAATATG 



GTATATGACC 
CCACCGAACG 
GACTTTGTTT 
AGCCTTTAAG 
CAAGTGGAGA 
ACTCGAGATT 
TTTCTCACAA 
AACAAAAAAT 
TTTGGGGTTC 
TCCAGCTGAT 
TTTCAGTGAC 
GATGGAGTTA 
CAATCAGTCT 
CCAAAGAGGC 
GACCCTCTAT 
AACAGTTCCT 
CTGATGAGAA 
TCAGTTGTGA 
AACGGTTCAA 
CCACCCCAAT 
CAATCTCCAG 
AATGCCTAGC 
GCCGAAATAT 
GAGACTTCTG 
CCTTTTATCG 
AAGAGACTGC 
GAAGGTTCAG 
TATCAAAAAC 



GCGACCTTGG 
ACCTACCTTT 
GGCGTGATAT 
GTCATGATGT 
ATACTTCAAT 
GGGATGTTAA 
GGTTTCGCTC 
GGGAGATGCT 
CAACTCGCTT 
AATATTGTTA 
ACAAACACAA 
TGCTGGAGCC 
GTACGTAAGT 
AGCAAGCACA 
ATGGAACTAT 
GGACAAAATG 
AAATGGAGGA 
CTATGAATCC 
CAGCCTGAGC 
CTTGGAGCGG 
CCAAAAATTT 
ATCAAGGATA 
TGTGCAACCA 
TAGAAGAAGG 
GATAAGGTAG 
CGAGACCTTT 
GTTCCGTTGT 
ATTAAAAAAA 



TCAGTGCAAA 
AATGCAGATA 
TCTTTATCAA 
TAGCTTCTTC 
AGCAGTGAAT 
TGAGGGTTTG 
ACTCCAGTAA 
ACTTGGTTGG 
TGGCTTGACA 
GTATTGCTCA 
ATGCTTCGTG 
AAAATTTATA 
CACAAAAAGA 
ACTCGAAATC 
GTATAATCAC 
TAGCAGTTAA 
TACTTGGTTG 
TGCTGAAAAT 
ATTATTCAGG 
GCTTCAGCTA 
AGATAAAGTT 
TTTCACCTGG 
ATCGTTGTAG 
GACCAATCTT 
AAGAAATTCC 
GCTAAATGGT 
TCAGAAGCAA 
TTAAATTAAC 
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GACCGGTGAA 650 

CTAAAGAAGG 700 

AGTAACTATG 750 

TATTGATAAT 800 

TCAAAATAGC 850 

ACTACTGGTG 900 

TGTTGGAACG 950 

ATTATCTAAA 1000 

GATGAATACG 1050 

AAGCTCATTT 1100 

CCTTTACAGC 1150 

AGTGCTATTT 12 00 

AATAGTAGGA 1250 

ACATGATCTT 1300 

TACACAGGAA 1350 

ATCCGGTACG 1400 

GTTCTACCAA 1450 

CCTGATTTTA 1500 

TATCCAGTTG 1550 

TGAAAGAATC 1600 

ACGACAGAAT 1650 

TGAGTTGGCG 1700 

GTACTGGAAC 1750 

GCACCAAACC 1800 

AGACATGTAT 1850 

TGGATATTGA 1900 

GATGTTCGGA 1950 

TTTAGGAGAC 2 000 
2007 



35 



40 



45 



2) INFORMATION FOR SEQ ID NO: 1174 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1174 
GAACGTGGTG AAGTTCGC 



18 



50 



55 



60 



2) INFORMATION FOR SEQ ID NO: 1175 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1175 

643 
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GTTACTGGTG TAGAAATGTT C 
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2) INFORMATION FOR SEQ ID NO: 1176 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ, ID NO: 1176 
TACTGGTGTA GAAATGTTC 1 



2) INFORMATION FOR SEQ ID NO: 1177 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1177 
GCTCAACAAG TTCCAGATTA 



2) INFORMATION FOR SEQ ID NO: 1178 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 56 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus aureus 

(B) STRAIN: NCTC8325 

(C) ACCESSION NUMBER: X52593 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1178 

ATGAACTGAT TATACTTAAC ATTAAAAAAG ATGATAACAC CTTCTACACC 50 

TCCATATCAC AAAAAATTAT AACATTATTT TGACATAAAT ACTACATTTG 100 

TAATATACTA CAAATGTAGT CTTATATAAG GAGGATATTG ATGAAAAAGA 150 

TAAAAATTGT TCCACTTATT TTAATAGTTG TAGTTGTCGG GTTTGGTATA 200 

TATTTTTATG CTTCAAAAGA TAAAGAAATT AATAATACTA TTGATGCAAT 250 

TGAAGATAAA AATTTCAAAC AAGTTTATAA AGATAGCAGT TATATTTCTA 300 

AAAGCGATAA TGGTGAAGTA GAAATGACTG AACGTCCGAT AAAAATATAT 350 
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AATAGTTTAG GCGTTAAAGA TATAAACATT CAGGATCGTA AAATAAAAAA 400 

AGTATCTAAA AATAAAAAAC GAGTAGATGC TCAATATAAA ATTAAAACAA 450 

ACTACGGTAA CATTGATCGC AACGTTCAAT TTAATTTTGT TAAAGAAGAT 500 

GGTATGTGGA AGTTAGATTG GGATCATAGC GTCATTATTC CAGGAATGCA 550 

5 GAAAGACCAA AGCATACATA TTGAAAATTT AAAATCAGAA CGTGGTAAAA 600 

TTTTAGACCG AAACAATGTG GAATTGGCCA ATACAGGAAC ACATATGAGA 650 

TTAGGCATCG TTCCAAAGAA TGTATCTAAA AAAGATTATA AAGCAATCGC 700 

TAAAGAACTA AGTATTTCTG AAGACTATAT CAACAACAAA TGGATCAAAA 750 

TTGGGTACAA GATGATACCT TCGTTCCACT TTAAAACCGT TAAAAAAATG 800 

LO GATGAATATT TAAGTGATTT CGCAAAAAAA TTTCATCTTA CAACTAATGA 850 

AACAGAAAGT CGTAACTATC CTCTAGAAAA AGCGACTTCA CATCTATTAG 900 

GTTATGTTGG TCCCATTAAC TCTGAAGAAT TAAAACAAAA AGAATATAAA # 950 

GGCTATAAAG ATGATGCAGT TATTGGTAAA AAGGGACTCG AAAAACTTTA 1000 

CGATAAAAAG CTCCAACATG AAGATGGCTA TCGTGTCACA ATCGTTGACG 1050 

L5 ATAATAGCAA TACAATCGCA CATACATTAA TAGAGAAAAA GAAAAAAGAT 1100 

GGCAAAGATA TTCAACTAAC TATTGATGCT AAAGTTCAAA AGAGTATTTA 1150 

TAACAACATG AAAAATGATT ATGGCTCAGG TACTGCTATC CACCCTCAAA 1200 

CAGGTGAATT ATTAGCACTT, GTAAGCACAC CTTCATATGA CGTCTATCCA 1250 

TTTATGTATG GCATGAGTAA CGAAGAATAT AATAAATTAA CCGAAGATAA 13 00 

10 AAAAGAACCT CTGCTCAACA AGTTCCAGAT TACAACTTCA CCAGGTTCAA 1350 

CTCAAAAAAT ATTAACAGCA ATGATTGGGT TAAATAACAA AACATTAGAC 14 00 

GATAAAACAA GTTATAAAAT CGATGGTAAA GGTTGGCAAA AAGATAAATC 1450 

TTGGGGTGGT TACAACGTTA CAAGATATGA AGTGGTAAAT GGTAATATCG 1500 

ACTTAAAACA AGCAATAGAA TCATCAGATA ACATTTTCTT TGCTAGAGTA 1550 

25 GCACTCGAAT TAGGCAGTAA GAAATTTGAA AAAGGCATGA AAAAACTAGG 1600 

TGTTGGTGAA GATATACCAA GTGATTATCC ATTTTATAAT GCTCAAATTT 1650 

CAAACAAAAA TTTAGATAAT GAAATATTAT TAGCTGATTC AGGTTACGGA 1700 

CAAGGTGAAA TACTGATTAA CCCAGTACAG ATCCTTTCAA TCTATAGCGC 1750 

ATTAGAAAAT AATGGCAATA TTAACGCACC TCACTTATTA AAAGACACGA 1800 

30 AAAACAAAGT TTGGAAGAAA AATATTATTT CCAAAGAAAA TATCAATCTA 1850 

TTAAATGATG GTATGCAACA AGTCGTAAAT AAAACACATA AAGAAGATAT 1900 

TTATAGATCT TATGCAAACT TAATTGGCAA ATCCGGTACT GCAGAACTCA 1950 

AAATGAAACA AGGAGAAAGT GGCAGACAAA TTGGGTGGTT TATATCATAT 2000 

GATAAAGATA ATCCAAACAT GATGATGGCT ATTAATGTTA AAGATGTACA 2050 

35 AGATAAAGGA ATGGCTAGCT ACAATGCCAA AATCTCAGGT AAAGTGTATG 2100 

ATGAGCTATA TGAGAACGGT AATAAAAAAT ACGATATAGA TGAATAACAA 2150 

AACAGTGAAG CAATCCGTAA CGATGGTTGC TTCACTGTTT TATTATGAAT 22 00 

TATTAATAAG TGCTGTTACT TCTCCCTTAA ATACAATTTC TTCATTTTCA 2250 

TTGTATGTTG AAAGTGACAC TGTAACGAGT CCATTTTCTT TTTTTATGGA 23 00 

4 0 TTTCTTATTT GTAATTTCAG CGATAACGTA CAATGTATTA CCTGGTATAC 2350 

AGTTTAATAA ATTTAACGTT ATTCATTTGT GTTCCTGCTA CAACTTCTTC 24 00 

TCCGTATTTA CCTTCTTCTA CCCATAATTT AAATGATATT GAAAGTGTAT 2450 

GCATGC 24 56 

45 



2) INFORMATION FOR SEQ ID NO: 1179 



(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 



55 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1179 
ATTTGGTGAC GGGTGACTTT 2 0 

60 
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2) INFORMATION FOR SEQ ID NO: 1180 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1180 
15 TCCACCGTTG CCAATCGCA 19 



2) INFORMATION FOR SEQ ID NO: 1181 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
25 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1181 

30 

AGCAGCTTAC TAGATGCCGT 20 



35 2) INFORMATION FOR SEQ ID NO: 1182 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1182 

AACTGCAAGA GATCCTTTGG 20 



2) INFORMATION FOR SEQ ID NO: 1183 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 2535 bases 
55 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

646 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: 175 

(C) ACCESSION NUMBER: M18729 
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(xii) SEQUENCE DESCRIPTION: SEQ ID NO: 1183 



ATGGCGATAG 

10 AAAGCAATAT 
AATTATTTTA 
TTAACGAGTC 
TCCCTATCAT 
ATAAGGTGGC 

15 GTTGTTAAAC 
TAGCAGTAAG 
AAGGCAATCA 
TATGTGACAG 
CCTCAAGGCT 

20 AACAAATCCT 
AGCTTTGAAG 
AACGGCATCT 
AATTGAACCA 
CAGATGGATT 

2 5 CTCAGGTAAG 
CGGCTATGGG 
GATAAGGAAC 
CCATTTCTTT 
ACATTGAGCG 

30 GATCTCTTGC 
GATTTTAGAA 
TGGATGCAAT 
GAAGCTCCTC 
TGAGACTTTA 

35 TTGCTGAGAT 
AAGATTGACT 
GCAACTGGGA 
ACTCAGAACG 
CTTGAGGCGC 

4 0 CATTCGTGAA 
AAGGAATTGC 
ACCCAGCATT 
CCGGAAAGGG 
ATATTCCAAA 

45 ACAGGGCCAA 
GACGGCGGTT 
ATTTACCGAT 
TTGGTTTCGG 
TGCCATTTCG 

50 GACGTGGAAC 
GAATATATCC 
CCATGAGTTG 
ACGTGGCAAC 
GAACCGGGAC 

55 TGGCTTGCCA 
TAGAGAATCA 
GTCACTGAAC 
AGCAGAATTA 
TGAATGTCTT 

60 



AAAAGCTATC 
CCAGATGCTT 
TGAGGATGCG 
GCAACAAGAA 
TCTGCCCAAC 
TATCGCAGAG 
GAGAGGTTGT 
CCGGACAGTC 
ATTTGGCCTA 
GTCTTTTGGA 
CGAGAAGTGG 
CAGCCGCCAG 
ACCTTCATTT 
AGTAAGCTGC 
CCTCAAACCT 
ATGCGACCAA 
AAACAAGGCA 
GATGCGTCTC 
GAATCGTCCA 
GAGCGTAGTG 
CTTGGCTAGT 
AGTTGGCGAC 
GGGATGGAGC 
CCCTGAGTTG 
ATGTGATTAC 
GACAAGTATC 
TGAGGCTAAG 
ACAATAAAAA 
AATGTGCCAG 
CTTTGGAACC 
GTGAGAAGTC 
GAGGTCGGCA 
GACGGTTGAT 
TGATTCGACC 
CGCCATGCTG 
TACGATTCAG 
ACATGAGTGG 
ATGGCCCAGC 
TTTTGATGCG 
GTCAGTCAAC 
CATGCGACCA 
TGCAACTTAT 
ATGAGCACAT 
ACTAGTCTGG 
TTTGGAGCAG 
CAGCTGATAA 
GCAGACCTTT 
AGGAACAGAG 
AGATTTCACT 
GCTAAACTGG 
AGTAGAGTTA 



ACCCGGCATG 
TTTTGCTCTT 
GTCAATGCTG 
TGCCGACAAT 
AGTATATCGA 
CAGATGGAAG 
TCAGGTCATT 
AGAATAATTT 
GCTTATATGG 
TTTCACGCTG 
TGTTGGGTTA 
ATGAATCTGG 
ATTGGATTTG 
TCCAGTATGT 
GTTATCCGCT 
GGCTAGTCTG 
GTCTTTTCTG 
TTGCGTTCTT 
ACGTCAAGAA 
ACTTGACAGA 
CGTGTTTCTT 
TACCTTGTCT 
AACCTACTCT 
GAGAGTTTGA 
AGATGGGGGA 
GTTGCGTTCT 
GAGCGAGAAA 
GGATGGCTAC 
CCCACTTTTT 
GAAGAATTAG 
AGCCAACCTC 
AGTACATCCA 
GTCTTACAGA 
TGAGTTTGGT 
TCGTTGAAAA 
ATGGCAGAAG 
GAAGTCTACC 
TGGGTTCCTA 
ATTTTTACCC 
CTTTATGGTG 
AGAACTCTCT 
GACGGGATGG 
CGGAGCTAAG 
AGTCTAGTTT 
GATGGGCAGG 
ATCCTACGGT 
TAGCAAGGGC 
AGTCCTCCTC 
CTTTGATAGG 
ATGTGTATAA 
AAACAGAAAC 



CAACAGTATG 
TCGGATGGGT 
CGCAGATTCT 
CCGATCCCTA 
TGTCTTGATT 
ATCCTAAACA 
ACGCCAGGGA 
TTTGGTTTCC 
ATTTGGTGAC 
GTTTGTGGGG 
TGACTTGTCT 
TACTCTCTTA 
CGATTGGCAA 
TCATCGGACT 
ACGAAATTAA 
GATTTGGTTG 
GCTTTTGGAT 
GGATTCATCG 
GTAGTGCAGG 
CAGTCTCAAG 
TTGGCAAAAC 
AGTGTGCCAC 
AGCCTATCTC 
TTAGCGCAGC 
ATTATCCGGA 
CAGAGAAGGG 
ACTCTGGTAT 
TATTTTCATG 
CCGCAAGGCG 
CCCGTATCGA 
GAATACGAAA 
GCGTTTACAA 
GTCTGGCGGT 
GACGATTCAC 
GGTTATGGGG 
ATACCAGTAT 
TATATGCGTC 
TGTTCCTGCT 
GTATCGGAGC 
GAGATGATGG 
CATTCTCTTT 
CTCTTGCTCA 
ACCCTCTTTG 
ACAACACTTG 
TCACCTTCCT 
ATCCATGTTG 
GGATAAGATT 
CCATGAGACA 
GCAGAAGAGC 
TATGACACCT 
TATAA 



TGGATATTAA 


50 


GATTTTTATG 

V*J^A J*. A, X X X a* X \J 


100 


GGAAATTTCC 


150 


TGGCGGGTGT 


200 


GAGCAGGGTT 


250 


AGCAGTTGGG 


300 


GAGTGGTCGA 


350 


ATAGACCGCG 


400 


GGGTGACTTT 


450 

"a *j \j 


AAATCCGTAA 


500 
~j \j \j 


GAGGAAGAAG 


550 


TGAAAAAGAA 


600 

U v v 


CGGTGGAGCA 


650 

\J J V 


CAGATGAGGG 


700 


GG A TTTCTTG 

Vj vj^i X X X X X \J 


750 


AGAATGCTPG 


R00 

O v v 


GAAAGOAAAA 


850 


CCCCTTGATT 

^« \_ X X VJii X X 


900 


TGTTTCTPGA 


950 


GGTGTTTATG 

VjvJ X VJ XXX Xi X \J 


1000 

X V \J V/ 


CAATCCAAAG 


1050 


GGATTCGTGC 

uun X X V VJ X VJ\« 


1100 


ATCGGACAAC 


1150 


GATTGCTCCT 


1200 


CTGGATTTGA 

v x \jUji xxx 


1250 


ACTAGCTGGA 


1300 


CAGCACGCTC 


1350 


TGACCAATTC 


1400 


ACGCTGAAAA 


1450 


GGGAGATATG 


1500 


TATTTATGCG 

x r» xxx * x x \j 


1550 


GCTCTAGCCC 


1600 


TGTGGCTGAA 


1650 


AAATTGATAT 


1700 


GCTCAGACCT 


1750 


TCAATTGGTT 


1800 


AGTTAGCCAT 


1850 


GAAAGCGCCC 


1900 


AGCAGATGAC 


1950 


AGGCCAATAA 


2000 


GATGAATTGG 


2050 


GTCCATCATC 


2100 


CGACCCACTA 


2150 


GTCAATGTCC 


2200 


TCACAAGATT 


2250 


CCAAGATTGC 


2300 


TTGACTCAGC 


2350 


AACTAGTGCT 


2400 


ATCCTATCCT 


2450 


ATGCAGGTTA 


2500 




2535 
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2) INFORMATION FOR SEQ ID NO: 1184 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 623 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

10 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-05 . 

(xiii) SEQUENCE DESCRIPTION: SEQ ID NO: 1184 



15 



TGACGGGTGA CTTTTATGTG, ACAGGTCTTT TGGATTTCAC GCTGGTTTGT 50 

GGGGAAATCC GTAACCTCAA GGCTCGAGAA GTGGTGTTGG GTTATGACTT 100 

20 GTCTGAGGAA GAAGAACAAA TCCTCAGCCG CCAGATGAAT CTGGTACTCT 150 

CTTATGAAAA AGAAAGCTTT GAAGACCTTC ATTTATTGGA TTTGCGATTG 200 

GCAACGGTGG AGCAAACGGC ATCTAGTAAG CTGCTCCAGT ATGTTCATCG 250 

GACTCAGATG AGGGAATTGA ACCACCTCAA ACCTGTTATC CGATACGAAA 3 00 

TTAAGGATTT CTTGCAGATG GATTATGCGA CCAAGGCTAG TCTGGATTTG 350 

25 GTTGAGAATG CTCGCTCAGG TAAGAAACAA GGCAGTCTTT TCTGGCTTTT 4 00 

GGATGAAACC AAAACGGCTA TGGGGATGCG TCTCTTGCGT TCTTGGATTC 450 

ATCGCCCCTT GATTGATAAG GAACGAATCG TCCAACGTCA AGAAGTAGTG 500 

CAGGTCTTTC TCGACCATTT CTTTGAGCGT AGTGACTTGA CAGACAGTCT 550 

CAAGGGTGTT TATGACATTG AGCGCTTGGC TAGTCGTGTT TCTTTTGGCA 600 

30 AAACCAATCC AAAGGATCTC TTG 623 



2) INFORMATION FOR SEQ ID NO: 1185 



35 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 621 bases 

(B) TYPE: Nucleic acid 
<C) STRANDEDNESS: Double 

40 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(Vi) ORIGINAL SOURCE: 
4 5 (A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-06 

(xiv) SEQUENCE DESCRIPTION: SEQ ID NO: 1185 

50 TGACGGGTGA CTTTTATGTG ACAGGTCTTT TGGATTTCAC GCTGGTTTGT 50 

GGGGAAATCC GTAACCTCAA GGCTCGAGAA GTGGTGTTGG GTTATGACTT 100 

GTCTGAGGAA GAAGAACAAA TCCTCAGCCG CCAGATGAAT CTGGTACTCT 150 

CTTATGAAAA AGAAAGCTTT GAAGACCTTC ATTTATTGGA TTTGCGATTG 200 

GCAACGGTGG AGCAAACGGC ATCTAGTAAG CTGCTCCAGT ATGTTCATCG 250 

55 GACTCAGATG AGGGAATTGA ACCACCTCAA ACCTGTTATC CGATACGAAA 300 

TTAAGGATTT CTTGCAGATG GATTATGCGA CCAAGGCTAG TCTGGATTTG 350 

GTTGAGAATG CTCGCTCAGG TAAGAAACAA GGCAGTCTTT TCTGGCTTTT 4 00 

GGATGAAACC AAAACGGCTA TGGGGATGCG TCTCTTGCGT TCTTGGATTC 4 50 

ATCGCCCCTT GATTGATAAG GAACGAATCG TCCAACGTCA AGAAGTAGTG 500 

60 CAGGTCTTTC TCGACCATTT CTTTGAGCGT AGTGACTTGA CAGACAGTCT 550 
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CAAGGGTGTT TATGACATTG AGCGCTTGGC TAGTCGTGTT TCTTTTGGCA 
AAACCAATCC AAAGGATCTC T 



600 
621 



2) INFORMATION FOR SEQ ID NO: 1186 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 622 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-11 

(xv) SEQUENCE DESCRIPTION: SEQ ID NO: 1186 

TGACGGGTGA CTTTTATGTG ACAGGTCTTT TGGATTTCAC GCTGGTTTGT 50 

GGGGAAATCC GTAACCTCAA GGCTCGAGAA GTGGTGTTGG GTTATGACTT 100 

GTCTGAGGAA GAAGAACAAA TCCTCAGCCG CCAGATGAAT CTGGTACTCT 150 

CTTATGAAAA AGAAAGCTTT GAAGACCTTC ATTTATTGGA TTTGCGATTG 200 

GCAACGGTGG AGCAAACGGC ATCTAGTAAG CTGCTCCAGT ATGTTCATCG 2 50 

GACTCAGATG AGGGAATTGA ACCACCTCAA ACCTGTTATC CGCTACGAAA 300 

TTAAGGATTT CTTGCAGATG GATTATGCGA CCAAGGCTAG TCTGGATTTG 350 

GTTGAGAATG CTCGCTCAGG TAAGAAACAA GGCAGTCTTT TCTGGCTTTT 4 00 

GGATGAAACC AAAACGGCTA TGGGGATGCG TCTCTTGCGT TCTTGGATTC 450 

ATCGCCCCTT GATTGATAAG GAACGAATCG TCCAACGTCA AGAAGTAGTG 500 

CAGGTCTTTC TCGACCATTT CTTTGAGCGT AGTGACTTGA CAGACAGTCT 550 

CAAGGGTGTT TATGACATTG AGCGCTTGGC TAGTCGTGTT TCTTTTGGCA 600 

AAACCAATCC AAAGGATCTC TT 622 



2) INFORMATION FOR SEQ ID NO: 1187 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 622 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-55 

(ix) SEQUENCE DESCRIPTION: SEQ ID NO: 1187 

TGACGGGTGA CTTTTATGTG ACAGGTCTTT TGGATTTCAC GCTGGTTTGT 50 

GGGGAAATCC GTAACCTCAA GGCTCGAGAA GTGGTGTTGG GTTATGACTT 100 

GTCTGAGGAA GAAGAACAAA TCCTCAGCCG CCAGATGAAT CTGGTACTCT 150 

CTTATGAAAA AGAAAGCTTT GAAGACCTTC ATTTATTGGA TTTGCGATTG 200 

GCAACGGTGG AGCAAACGGC ATCTAGTAAG CTGCTCCGGT ATGTTCATCG 250 

GACTCAGATG AGGGAATTGA ACCACCTCAA ACCTGTTATC CGCTACGAAA 3 00 

TTAAGGATTT CTTGCAGATG GATTATGCGA CCAAGGCTAG TCTGGATTTG 350 
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GTTGAGAATG CTCGCTCAGG 
GGATGAAACC AAAACGGCTA 
ATCGCCCCTT GATTGATAAG 
CAGGTCTTTC TCGACCATTT 
CAAGGGTGTT TATGACATTG 
AAACCAATCC AAAGGATCTC 



TAAGAAACAA GGCAGTCTTT TCTGGCTTTT 
TGGGGATGCG TCTCTTGCGT TCTTGGATTC 
GAACGAATCG TCCAACGTCA AGAAGTAGTG 
CTTTGAGCGT AGTGACTTGA CAGACAGTCT 
AGCGCTTGGC TAGTCGTGTT TCTTTTGGCA 
TT 



400 
450 
500 
550 
600 
622 
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20 
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40 
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2) INFORMATION FOR SEQ ID NO: 1188 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 599 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE : , Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus oralis 

(B) STRAIN: ATCC 35037 

(ix) SEQUENCE DESCRIPTION: SEQ ID NO: 1188 

GGGTGACTTT TATGTAACGG GGCTATTGGA TTTCACGTTG GTTTGTGGGG 50 

AAATTCGCAA TCTCAAGGCT AGAGAAGTGG TGCTGGGTTA TGACTTGTCT 100 

GAGGAAGAAG AACAAATCCT CAGTCGTCAG ATGAATCTGG TGCTTTCTTA 150 

TGAGAAGGAA GGCTTTGAGG ACCTTCATTT ACTGGATCCA CGACTGGCAG 200 

CTGTGGAGCA AGCGGCAGCT AGTAAGCTCC TCCAGTATGT TCACCGGACC 250 

CAGATGCGGG AATTGAACCA CCTCAAACCA GTTATCCGCT ATGAAATCAA 300 

AGATTTCTTA CAGATGGACT ATGCGACCAA GGCTAGTCTG GATTTGGTTG 350 

AGAATGCCCG TTCAGGCAAG AAGCAAGGCA GTCTTTTCTG GCTTTTAGAT 4 00 

GAAACCAAGA CGGCTATGGG AATGCGTCTC TTGCGTTCTT GGATTCATCG 4 50 

TCCTTTGATT GATAAGGAGC GAATCGTCCA GCGTCAAGAG GTGGTGCAGG 500 

TCTTTCTTGA CCACTTCTTT GAGCGTAGTG ATTTAACGGA CAGTCTTAAG 550 

GGTGTTTATG ATATCGAACG CTTGGCTAGT CGGGTTTCTT TTGGCAAGA 599 



2) INFORMATION FOR SEQ ID NO: 1189 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 624 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus mitis 

(B) STRAIN: ATCC 4 9456 

(x) SEQUENCE DESCRIPTION: SEQ ID NO: 1189 

GGTGACGGGT GACTTTTATG TGACAGGTCT TTTGGATTTT ACGCTGGTTT 50 

GTGGGGAAAT CCGCAATCTC AAGGCTCGAG AAGTGGTGCT GGGTTATGAC 100 

TTGTCTGAGG AAGAAGAACA GATCCTTAGT CGTCAGATGA ATCTGGTACT 150 

TTCCTATGAA AAAGAAGGCT TTGAAGACCT TCATTTACTG GATTCACGAT 200 
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TGGCAGCTGT GGAGCAAGCG 
CGGACTCAGA TGAGGGAATT 
AATCAAAGAT TTTTTGCAGA 
TGGTTGAGAA TGCCCGTTCA 
5 TTGGATGAAA CCAAAACAGC 
TCATCGCCCC CTGATTGATA 
TGCAGGTCTT TCTCGACCAT 
CTCAAGGGTG TTTATGACAT 
CAAAACCAAT CCAAAGGATC 



GCATCTAGTA AACTGCTTCA 
GAACCACCTC AAGCCTGTTA 
TGGATTATGC GACCAAGGCT 
GGCAAGAAGC AAGGTAGTCT 
TATGGGAATG CGTCTCTTGC 
AGGAACGAAT TGTCCAACGC 
TTCTTTGAGC GTAGTGATTT 
TGAGCGCTTG GCTAGTCGTG 
TCTT 



PCT/CAOO/01150 

GTATGTTCAT 250 

TCCGCTATGA 300 

AGTCTGGATT 350 

TTTTTGGCTT 400 

GGTCTTGGAT 450 

CAAGAAGTTG 500 

GACAGACAGT 550 

TTTCTTTTGG 600 
624 



2) INFORMATION FOR SEQ ID NO: 1190 

15 (i) SEQUENCE CHARACTERISTICS: , 

(A) LENGTH: 599 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



20 



45 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus witis 
25 (B) STRAIN: LSPQ 2583 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1190 

TGACGGGTGA CTTTCAGGTG ACTAGTTTAG AGGACTTTGT CTTGGTCTGC 50 

30 GGGGAAATCC GCAATTTGAA AGCTAGGGAA GTGGTGCTGG GCTATGCCTT 100 

GCCAGAAGCT GAGGAGCAGG TTTTGGCTGG ACAGATGAAC CTTTTACTGT 150 

CCTATGTGGA GAAGGTTTTG GAGGATGTTC AGCTGCTGGG CGAGGAGCTG 200 

TCTCCTATGG AGCGTCAGGC AGCAGGGAAA CTGCTGGAGT ATGTGCACCG 250 

GACCCAGATG AGGGAGCTCA GCCATTTGAA GAAGGCTCAG CATTATGAAA 3 00 

35 TCAAGGACTT CCTGCAAATG GACTATGCCA CCAAGGCGAG TCTGGATTTG 3 50 

ACAGAAAATG CTCGCTCGGG CAAGAAGCAC GGCAGTCTTT ATTGGCTGAT 4 00 

GGACGAGACT AAGACGGCCA TGGGCGGCCG CATGCTGCGC TCTTGGATCC 4 50 

AGCGTCCGCT GATTGATGAA GCGCGAATTA GCCAGCGACA GAATGTCGTT 500 

GAGGTTTTTC TGGATCATTT CTTTGAGCGG AGTGATTTGA CGGAGAGCCT 550 

40 CAAGGGGGTC TATGATATCG AGCGGCTGGC TAGTCGGGTG TCTTTTGGC 599 



2) INFORMATION FOR SEQ ID NO: 1191 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 622 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
50 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
55 (A) ORGANISM: Streptococcus mitis 

(B) STRAIN: ATCC 903 

(ix) SEQUENCE DESCRIPTION: SEQ ID NO: 1191 

60 TGACGGGTGA CTTTCAGGTG ACTAGTTTAG AGGACTTTGC CCTGGTCTGC 50 
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GGGGAAATCC 
GCCAGAAGCT 
CCTATGTACA 
TCTCCTATGG 
GACCCAGATG 
TCAAGGACTT 
ACAGAAAATG 
GGACGAGACC 
AGCGTCCGTT 
GAGGTTTTTC 
CAAGGGAGTC 
AGACCAATCC 



GTAATTTGAA 
GAGGAGCAGG 
GACGGCCTTG 
AGCGTCAGGC 
AGGGAGCTCA 
TCTGCAAATG 
CTCGCTCGGG 
AAGACGGCCA 
GATTGATGAA 
TGGAACATTT 
TATGATATCG 
AAAGGATCTC 



GGCTAGGGAA GTGGTGCTGG GCTATGCTTT 
TCTTGGCTGG ACAGATGAAT CTTTTGCTGT 
GACGATGTCC AGCTGCTGGG CGAGGAACTG 
AGCGGGGAAA TTGCTAGAGT ATGTGCACCG 
GCCATTTGAA GAAGGCCCAG CATTATGAAA 
GATTATGCTA CCAAGGCGAG TCTGGATTTG 
TAAGAAACAC GGCAGTCTTT ATTGGCTGAT 
TGGGCGGCCG TATGCTGCGC TCTTGGATCC 
GTGCGAATTA GCCAGCGGCA GAATGTCGTC 
CTTTGAGCGG AGTGATTTGA CGGAGAGCCT 
AGCGGCTGGC TAGTCGGGTG TCTTTTGGCA 
TT 



100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
622 



2) INFORMATION FOR SEQ ID NO: 1192 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1192 
GGTAAAACAG GAACCTCTAA CT 



2) INFORMATION FOR SEQ ID NO: 1193 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1193 
GGTAAGACAG GTACTTCTAA CT 



2) INFORMATION FOR SEQ ID NO: 1194 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1194 
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2) INFORMATION FOR SEQ ID NO: 1195 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ , ID .NO : 1195 
CATTTCAAGT AACACAACTG AATC 



2) INFORMATION FOR SEQ ID NO: 1196 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1196 
GCCATTTCAA GTAATACAAC AGAA 



2) INFORMATION FOR SEQ ID NO: 1197 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1197 
CAAACGCCAT TTCAAGTAAT ACAAC 



2) INFORMATION FOR SEQ ID NO: 1198 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 381 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus saprophyticus 

(B) STRAIN: ATCC 43867 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1198 

AACGGGCGTC TCGATAGAAA AACACGTGAA AATCCCAATG ATTATAAACA 50 

ATCAATATAC GATTTTGCTG AAGCTGTAAC AAAAGGTATT AAGGAACAAA 100 

CAAATAAAAA TTAATAGGCA ACTTAACCAG AATCGTTAAA ACTATATGAC 150 

GATTCTGGTT TTTTAAATTC AAAAAGTTTT CTAAAAAATT TACTTGCTTC 200 

TTTAAAGTAT AGGTATGAAA TACAATTGAT TAAAATAGTA AAGGAAATGA 250 

ATCATGAAAC AATTAACTAA GCCTTTATAC TTTTACCTAT TACTTTTTAT 300 

TACAACAACG CTGATTGGCG CGTTACTATT ATATTTGCCA ATCACAKGTA 3 50 

AACATCCTAT TGATTTTGTG GACGCCCGTT A 3 81 



2) INFORMATION FOR SEQ ID NO: 1199 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1199 
GTATTAAAGA AGATATCCAA AAAGC 



2) INFORMATION FOR SEQ ID NO: 1200 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1200 
TCAAAGAAGA AACTAAAAAA GCTGT 



2) INFORMATION FOR SEQ ID NO: 1201 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1201 
AACGTAGGTG TCCTTCTTC 



2) INFORMATION FOR SEQ ID NO: 1202 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1202 
GTGTTGAAAT GTTCCGTAAA CA 



2) INFORMATION FOR SEQ ID NO: 1203 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1203 
GGIGARMGIG GIAAYGARAT G 



2) INFORMATION FOR SEQ ID NO: 1204 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 04 
GCIAAYAACI TCIWMYATGC C 



2) INFORMATION FOR SEQ ID NO: 1205 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 
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(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1205 
AAYACITCIA WYATGCCIGT 



2) INFORMATION FOR SEQ ID NO: 1206 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: , Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 06 
CKISRIGTIG ARTCIGCCA 



2) INFORMATION FOR SEQ ID NO: 1207 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1207 
CCITCITCWC CIGGCATYTC 



2) INFORMATION FOR SEQ ID NO; 1208 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1208 
TCAAAAAGTT TTCTAAAAAA TTTAC 



2) INFORMATION FOR SEQ ID NO: 1209 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1209 
ACGGGCGTCC ACAAAATCAA TAGGA 



2) INFORMATION FOR SEQ ID NO: 1210 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1210 
ACCAGCTTGC CCAATACAAA GG 



2) INFORMATION FOR SEQ ID NO: 1211 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1211 
ATTCTTGTAA CAGGCTTTGA TCCC 



2 ) INFORMATION FOR SEQ ID NO: 1212 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1212 
CCICCIRGIG GIGAIACIGC WCC 
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2) INFORMATION FOR SEQ ID NO: 1213 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 
10 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1213 

15 

AARGG I GG I A CIGCIGCIAT HCCIGG 26 



20 2) INFORMATION FOR SEQ ID NO: 1214 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 
25 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1214 

GGTAAAACAG GTACCTCTAA CTA 23 



35 



45 



50 



2) INFORMATION FOR SEQ ID NO: 1215 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 133 7 bases 
40 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pyogenes 

(B) STRAIN: D471 

(C) ACCESSION NUMBER: X65717 

(xii) SEQUENCE DESCRIPTION: SEQ ID NO: 1215 



AACAAAATAA AAGAACTTAC CTATTTTCCA TCCAAAATGT TTAGCAATCA 50 

TCATCTGCAA GGCAACGTAT TGCATGGCAT TGATGTGATG AGCAACTAAT 100 

55 ATGTCATTAG AACGTTGCGT CAAACTAGCA TCTAAATAAA GATCGAAATG 150 

CAGTTATCAA AAATGCAAGC TCCTATCGGC CCTTGTTTTA ATTATTACTC 2 00 

ACATTGCCTT AATGTATTTA CTTGCTTATT ATTAACTTTT TTGCTAAGTT 2 50 

AGTAGCGTCA GTTATTCATT GAAAGGACAT TATTATGAAA ATTCTTGTAA 300 

CAGGCTTTGA TCCCTTTGGC GGCGAAGCTA TTAATCCTGC CCTTGAAGCT 350 

60 ATCAAGAAAT TGCCAGCAAC CATTCATGGA GCAGAAATCA AATGTATTGA 4 00 
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AGTTCCAACG 
AAAGCTTTCA 
ACTGGACTAA 
TCCTGATAAC 
5 GTAAAGCAGC 
ATTCATCAGG 
TGTTTGCAAT 
GTCCAAATGC 
GTTGTTGATA 

10 AGGAATTGAG 
ATTTAAAACG 
TATTTTAAAA 
. AATGAAAGAC 
AATCTAATAC 

L5 AAACGTTGTT 
GTAATGGGAC 
AGGTGGCTAC 
GTTTGCCAAA 
AAGCATTGAG 

20 
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GTTTTTCAAA 
ACCTGATGCA 
CGCCAGAACG 
GAAGGGAATC 
TTATTTTTCA 
CTGGGCTTCC 
CATTTGATGT 
CAAAGCTGGG 
AACCTAATAC 
GCTGCTATTT 
TGTAGGGGGC 
AAACAGAGAT 
CTAGGGAAAC 
TACTTCTAAC 
GATCTAAATG 
TTATCAAGAT 
CACCTTTCCA 
TATTGGCATA 
ATAGGGACAC 



AATCTGCCGA 
GTCCTTTGTA 
CGTTGCCATT 
AGCCTATTGA 
ACCTTGCCAA 
TGCTTCTGTT 
ATCAAGCCCT 
TTTATGCATA 
AGCTGCCATG 
TTGCCATTGT 
GCTACTCACT 
ATGAACTAAC 
TTTTCATCGG 
ATCAGCGGGT 
AGCAAfCCCA 
TTAAGGCTTT 
GAATCAGGTT 
GAGATAACCA 
TTTCTATAGC 



TGTGCTCCAG 
TTGGGCAAGC 
AATCAAGACG 
TACACCTATT 
TCAAAGCGAT 
TCTAATACAG 
TTACTTAGTG 
TTCCCTTTAT 
AACCTCGATG 
CGATTTCAAA 
GACTGTGACG 
TCTGTTTTTT 
TCTTTCTCAA 
ATAGTTTGCC 
, TTCAAAACAT 
TAACAAAGGA 
GGAGTTTGAC 
TCTACATACT 
AACTAGT 



CAGCATATCG 
TGGTGGCCGG 
ATGCTCGCAT 
CGTGCAGATG 
GGTTGCTGCC 
CTGGTACCTT 
GATAAATATT 
GATGGAACAG 
ATATTACAAG 
GATCGTTCCG 
CTACTAAACC 
TTGTGCTAAA 
TTGTCATCTT 
AGTAATTAAG 
TAAGGTCAGG 
CTAATATCAT 
AATAGTATTG 
CTAATTCGTT 



450 
500 
550 
600 
650 
700 
750 
800 
850 
900 
950 
1000 
1050 
1100 
1150 
1200 
1250 
1300 
1337 



2) INFORMATION FOR SEQ ID NO: 1216 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

30 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1216 
35 GGTAAGACTG GTACATCAAA CTA 23 



50 



2) INFORMATION FOR SEQ ID NO: 1217 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 5 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
55 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1217 

50 

CAAATGCCAT TTCAAGTAAC ACAAC 25 



55 2) INFORMATION FOR SEQ ID NO: 1218 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 
60 (C) STRANDEDNESS: Single 
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(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1218 

CAAACGCCAT TTCAAGTAAC ACAAC 25 



10 



20 



25 



60 



2) INFORMATION FOR SEQ ID NO: 1219 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 25 bases. 
15 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1219 
CAAATGCTAT TTCAAGTAAT ACAAC 25 

2) INFORMATION FOR SEQ ID NO: 1220 



(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

35 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1220 
CAAACGCCAT TTCAAGTAAT ACGAC 25 

40 

2) INFORMATION FOR SEQ ID NO: 1221 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

50 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1221 
55 GAYACICCIG GICAYGTIGA YTT 23 



2) INFORMATION FOR SEQ ID NO: 1222 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1222 
ATYGAYACIC CIGGICAYGT IGAYTT 



2) INFORMATION FOR SEQ ID NO: 1223 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1223 
AYITCIARRT GIARYTCRCC CATICC 



2) INFORMATION FOR SEQ ID NO: 1224 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1224 
CCIGYIHTIY TIGARCCIAT IATG 



2) INFORMATION FOR SEQ ID NO: 1225 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1225 
TAICCRAACA TYTCISMIAR IGGIAC 
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2) INFORMATION FOR SEQ ID NO: 1226 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE : DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 122 6 
GTIRMRTAIC CRAACATYTC 



2) INFORMATION FOR SEQ ID NO: 1227 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1227 
GTICCIYTIK CIGARATGTT YGGITA 



2) INFORMATION FOR SEQ ID NO: 1228 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 9 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1228 
GTICCIYTIK CIGARATGTT YGGITAYGC 



2) INFORMATION FOR SEQ ID NO: 1229 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 122 9 
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26 



5 2) INFORMATION FOR SEQ ID NO: 1230 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2145 bases 

(B) TYPE: Nucleic acid 
LO (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

15 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 
(D) ACCESSION NUMBER: X00415 



20 



(xiii) SEQUENCE DESCRIPTION: SEQ ID NO: 1230 



TGAACGCCTA AAAGATAAAC GAGGAAACAA ATGGCTCGTA CAACACCCAT 50 

CGCACGCTAC CGTAACATCG GTATCAGTGC GCACATCGAC GCCGGTAAAA 100 

CCACTACTAC CGAACGTATT CTGTTCTACA CCGGTGTAAA CCATAAAATC 150 

GGTGAAGTTC ATGACGGCGC TGCAACCATG GACTGGATGG AGCAGGAGCA 200 

25 GGAACGTGGT ATTACCATCA CTTCCGCTGC GACTACTGCA TTCTGGTCTG 250 

GTATGGCTAA GCAGTATGAG CCGCATCGCA TCAACATCAT CGACACCCCG 3 00 

GGGCACGTTG ACTTCACAAT CGAAGTAGAA CGTTCCATGC GTGTTCTCGA 350 

TGGTGCGGTA ATGGTTTACT GCGCAGTTGG TGGTGTTCAG CCGCAGTCTG 4 00 

AAACCGTATG GCGTCAGGCA AACAAATATA AAGTTCCGCG CATTGCGTTC 450 

30 GTTAACAAAA TGGACCGCAT GGGTGCGAAC TTCCTGAAAG TTGTTAACCA 500 

GATCAAAACC CGTCTGGGCG CGAACCCGGT TCCGCTGCAG CTGGCGATTG 550 

GTGCTGAAGA ACATTTCACC . GGTGTTGTTG ACCTGGTGAA AATGAAAGCT 600 

ATCAACTGGA ACGACGCTGA CCAGGGCGTA ACCTTCGAAT ACGAAGATAT 650 

CCCGGCAGAC ATGGTTGAAC TGGCTAACGA ATGGCACCAG AACCTGATCG 700 

35 AATCCGCAGC TGAAGCTTCT GAAGAGCTGA TGGAAAAATA CCTGGGTGGT 750 

GAAGAACTGA CTGAAGCAGA AATCAAAGGT GCTCTGCGTC AGCGCGTTCT 800 

GAACAACGAA ATCATCCTGG TAACCTGTGG TTCTGCGTTC AAGAACAAAG 850 

GTGTTCAGGC GATGCTGGAT GCGGTAATTG ATTACCTGCC ATCCCCGGTT 900 

GACGTACCTG CGATCAACGG TATCCTGGAC GACGGTAAAG ACACTCCGGC 950 

4 0 TGAACGTCAC GCAAGTGATG ACGAGCCGTT CTCTGCACTG GCGTTCAAAA 1000 

TCGCTACCGA CCCGTTTGTT GGTAACCTGA CCTTCTTCCG TGTTTACTCC 1050 

GGTGTGGTTA ACTCTGGTGA TACCGTACTG AACTCCGTGA AAGCTGCACG 1100 

TGAGCGTTTC GGTCGTATCG TTCAGATGCA CGCTAACAAA CGTGAAGAGA 1150 

TCAAAGAAGT TCGCGCGGGC GACATCGCTG CTGCTATCGG TCTGAAAGAC 1200 

45 GTAACCACTG GTGACACCCT GTGTGACCCG GATGCGCCGA TCATTCTGGA 1250 

ACGTATGGAA TTCCCTGAGC CGGTAATCTC CATCGCAGTT GAACCGAAAA 1300 

CCAAAGCTGA CCAGGAAAAA ATGGGTCTGG CTCTGGGCCG TCTGGCTAAA 1350 

GAAGACCCGT CTTTCCGTGT ATGGACTGAC GAAGAATCTA ACCAGACCAT 1400 

CATCGCGGGT ATGGGCGAAC TGCACCTCGA CATCATCGTT GACCGTATGA 1450 

50 AGCGTGAATT CAACGTTGAA GCGAACGTAG GTAAACCGCA GGTTGCTTAC 1500 

CGTGAAACTA TCCGCCAGAA AGTTACCGAT GTTGAAGGTA AACACGCGAA 1550 

ACAGTCTGGT GGTCGTGGTC AGTATGGTCA TGTTGTTATC GACATGTACC 1600 

CGCTGGAGCC GGGTTCAAAC CCGAAAGGCT ACGAGTTCAT CAACGACATT 1650 

AAAGGTGGTG TAATCCCTGG CGAATACATC CCGGCCGTTG ATAAAGGTAT 1700 

55 CCAGGAACAG CTGAAAGCAG GTCCGCTGGC AGGCTACCCG GTAGTAGACA 1750 

TGGGTATTCG TCTGCACTTC GGTTCTTACC ATGACGTTGA CTCCTCTGAA 1800 

CTGGCGTTTA AACTGGCTGC TTCTATCGCC TTTAAAGAAG GCTTTAAGAA 1850 

AGCGAAACCA GTTCTGCTTG AGCCGATCAT GAAGGTTGAA GTAGAAACTC 1900 

CGGAAGAGAA CACCGGTGAC GTTATCGGTG ACTTGAGCCG TCGTCGTGGT 1950 

60 ATGCTCAAAG GTCAGGAATC TGAAGTTACT GGCGTTAAGA TCCACGCTGA 2000 
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AGTACCGCTG TCTGAAATGT TCGGATACGC AACTCAGCTG CGTTCTCTGA 2050 
CCAAAGGTCG TGCATCATAC ACTATGGAAT TCCTGAAGTA TGATGAAGCG 2100 
CCGAGTAACG TTGCTCAGGC CGTAATTGAA GCCCGTGGTA AATAA 214 5 



2) INFORMATION FOR SEQ ID NO: 1231 

(i) SEQUENCE CHARACTERISTICS: 
L0 (A) LENGTH: 37 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

15 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 31 
GCGAGCCCGA AGATAAAAAA GAACCTCTGC TGCTCGC 37 

20 

2) INFORMATION FOR SEQ ID NO: 1232 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

30 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1232 
35 GGAGCCGCGC GATTTTATAA ATGAATGTTG ATAACCGGCT CC 42 



40 



2) INFORMATION FOR SEQ ID NO: 1233 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
45 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1233 

50 

GCGAGCGTTA CTGGTGTAGA AATGTTCCGG CTCGC 35 



55 2) INFORMATION FOR SEQ ID NO: 1234 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 
60 (C) STRANDEDNESS: Single 
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(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1234 
ACTAAATAAA CGCTCATTCG 



2 ) INFORMATION FOR SEQ ID NO: 1235 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1235 
GCGAGCCGAA GTTGAAGTTG TTGGTATTGC TGGCTCGC 



2) INFORMATION FOR SEQ ID NO: 1236 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1236 
GCGAGCCGTG GTGAAGTTCG CGTTGGTGGC TCGC 



2) INFORMATION FOR SEQ ID NO: 1237 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1237 
GCGAGCCGCG AAATCGAAGT TGCTGTATTA GGGCTCGC 



2) INFORMATION FOR SEQ ID NO: 1238 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1238 
GCGAGCGGCG TTAATTTTGG CACCGAAGAA GAGCTCGC 



2) INFORMATION FOR SEQ ID NO: 1239 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 123 9 
GCGAGCGCAG ACCTTTCAGC AGAGGAGGCT CGC 



2) INFORMATION FOR SEQ ID NO: 1240 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1240 
GCGAGCCGGC AAGACAATAT GACAGCAAAA TCGCTCGC 



2) INFORMATION FOR SEQ ID NO: 1241 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1241 
GCGAGCGGGG AACGAGGATG ATTTGATTGG CTCGC 
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2) INFORMATION FOR SEQ ID NO: 1242 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 600 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecium 

(E) STRAIN: BM4147-1 

(F) ACCESSION NUMBER: U3 9790 

(xiv) SEQUENCE DESCRIPTION: SEQ ID NO: 1242 



TTCTTAGAGA CATTGAATAT 
TGCATGTGCC ATGGATAAAA 
GTGTGCCGCA AGTTCCTTAT 
AATCCTAAAA AAGTATTTGA 
GTTTGTCAAA CCTGCGAATA 
AAAACCGAGA AGAGCTGCAA 
TCTCGAGCAA TCGTTGAACA 
TGTATTAGGA AATGAAGATG 
AAGACGTAGC ATTCTATGAT 
GAAATGCAGA TTCCAGCCGA 
AGAGTACGCG AAGTTAGCTT 
GGTGCGATTT CTTTTTGACA 



GCCTTATGTC GGCGCAGGCG 
TCATGACCAA GTATATTTTA 
GTACCAGTAC TTAAGAATCA 
TCAATGTGAA GGTTCTTTGC 
TGGGTTCTAG TGTCGGCATT 
AATGCTTTAG CAACAGCCTA 
AGGAATTGAA GCGCGCGAAA 
TTCGGACGAC TTTGCCTGGC 
TATGAAGCCA AATATATCAA 
AGTGCCGGAA GAAGTTTATC 
ACACGATGTT AGGTGGAAGC 
AATAAAAATG AATTATTCCT 



TATTGACCAG 50 

CAAGCTGCTG 100 

ATGGAAAGAA 150 

TTTATCCGAT 2 00 

ACAAAGGCAG 250 

TCAGTATGAT 3 00 

TCGAAGTTGC 350 

GAAGTCGTAA 4 00 

TAATAAAATC 4 50 

AAAAAGCGCA 500 

GGATTGAGCC 550 

GAATGAATTA 600 



2) INFORMATION FOR SEQ ID NO: 1243 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2275 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecal is 
(C) ACCESSION NUMBER: M38386 

(xv) SEQUENCE DESCRIPTION: SEQ ID NO: 1243 

GGTACCAAAG AAAAAAACGA ACGCCACAAC CAACAGCCTC TAAAGCAACA 50 

CCTGCTTCTG AAATTGAGGG AGATTTAGCA AATGTCAATG AGATTCTTTT 100 

GGTTCACGAT GATCGTGTCG GGTCAGCAAC GATGGGAATG AAAGTCTTAG 150 

AAGAAATTTT AGATAAAGAG AAAATTTCAA TGCCGATTCG AAAAATTAAT 200 

ATTAATGAAT TAACTCAACA AACACAGGCT TTAATTGTCA CAAAAGCTGA 250 

ACTAACGGAA CAAGCACGTA AAAAAGCACC GAAAGCGACA CACTTATCAG 300 

TAAAAAGTTA TGGTTAATCC CCAAAAATAT GAAACAGTGG GTTTCGCTCT 350 

TAAAAGAAAG TGCCTAGAGA GGAAGAAAAC AATGGAAAAT CTTACGAATA 4 00 

TTTCAATTGA ATTAAATCAA CAGTTTAATA CAAAAGAAGA AGCTATTCGC 4 50 

TTTTCCGGCC AGAAACTAGT CGAGGCAGGC TGTGTTGAGC CCGCTTATAT 500 
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CGAAGCAATG ATTGAAAGAG ACCAATTGCT ATCTGCCCAT ATGGGGAATT 550 

TTATTGCCAT TCCTCATGGA ACAGAAGAAG CCAAAAAATT AGTGAAAAAA 600 

TCAGGAATCT GTGTAGTGCA AGTCCCAGAG GGCGTTAATT TTGGCACCGA 650 

AGAAGATGAA AAAATTGCTA CCGTATTATT TGGGATTGCC GGAGTCGGTG 700 

5 AAGAACATTT GCAATTAGTC CAACAAATTG CACTTTATTG TAGTGATATG 750 

GATAACGTGG TGCAACTTGC CGATGCATTA AGTAAAGAAG AAATAACAGA 800 

AAATTTAGCC ATTGCTTAAA GGAGAGAATA AGAATGAACG CAGTACATTT 850 

TGGAGCAGGA AATATTGGAC GCGGCTTTAT TGGCGAAATT TTAGCTAAAA 900 

CGGGTTTCAT ATTACCGTTT GTGGATGTTA ATGGAAACCA TCATCAAGCG 950 

10 TTAAAAGAAC GTAAAAGTTA TACAATTGAA TTGGCCGATG CCTCACATCA 1000 

ACAAATTAAC GTTGAAAATG TGACCGGGTT AAATAACATG ACAGAACCAG 1050 

AAAAAGTAGT AGAAGCAATT GCGGAAGCCG ATTTAGTCAC GACGGCAATT 1100 

GGTCCTAATA TTTTACCAAG AATTGCTGAA TTAATTGCTC AAGGAATTGA 1150 

TGCACGTGCC GAAGCAAATT GTCAAAACGG CCCGCTGGAT ATTATCGCTT 1200 

15 GTGAAAATAT GATTGGTGGT TCAACCTTTT TAGCAGAAGA AGTGGCCATA 125 0 

ATATTTGAAA AACCCAGCTT ATCTGAACAA TGGATTGGTT TTCCTGATGC 1300 

GGCAGTTGAT CGGATTGTTC CATTACAAAA ACATAAAGAT CCACTTTTTG 1350 

TTCAAGTTGA GCCTTTTTGT GAATGGGTCA TTGATGATAC CAACCGAAAA 1400 

GCCAAAGAGA TTCAGTTAGA AGGCGTCATT ACTTGTCGAT TAGAGCCGTA 1450 

20 TATTGAACGA AAATTATTTA GTGTAACCAG TGGCCATGCT ACAGTTGCCT 1500 

ATACAGGGGC GTTGTTAGGC TATCAAACCA TTGACGAAGC GATGCAGGAC 1550 

GCCTTAGTGG TAGCGCAACT CAAATCAGTT TTGCAGGAAA CCGGTAAACT 1600 

TTTAGTGGCC AAATGGAATT TTGATGAACA AGAACATGCA GCCTATATTG 1650 

AAAAAATTAT CAACCGTTTC CAAAATAAAT ATATTTCAGA TGCTATTACA 1700 

25 CGTGTAGCAC GGACACCAAT CAGAAAATTA GGTGCGCAAG AACGGTTTAT 1750 

TCGACCAATC CGTGAATTAC AGGAACGCAA TCTAGTGTCG GCCGCATTTA 1800 

TAGCAATGAT TGGTATTGTC TTTAATTATC ATGATCCAGA AGATGAACAA 1850 

AGCCGTCAAT TACAGGAAAT GCTTGACCAA GAAAGTGTTG ATACAGTGGA 1900 

TCGCTGAAGT AACGGGCATT GAAGATCCAG AAACGGTTAA AAATATTAAA 1950 

30 CAAAACGTAG AACTGCTATG CGCGACCACA AGTAGCATAA TTAACAAAAT 2 000 

CCTTCTACCA AGATACTTCA CATTTCTTAA TTAAAGAAAA AACAACCGCG 2050 

CCTCACCTGA GCCGACCCCC AAAAGTTAGA CCTAGAAATC TAACTTTTGG 2100 

AGGTTTTTTT GTATGGCAAA ATACAGTTTT GAAATTTAAA CTTAAACTTG 2150 

TTCATGACTA CTTATATGGT CAAGGAGGTC TAAGGTTTCT CGCAAAGAAG 22 00 

35 TATGGGTTTA AAGATAGTCT CAAATAAGCA AATGGATAAA TGCCTATAAA 2250 

GAACTTGGTG AAGAAGGGGG GATCC 22 75 



40 2) INFORMATION FOR SEQ ID NO: 1244 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 442 bases 

(B) TYPE: Nucleic acid 
45 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

50 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus aureus subsp. aureus 

(B) STRAIN: ATCC 25923 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1244 



GATCAATCTT TGTCGGTACA CGATATTCTT CACGACTAAA TAAACGCTCA 50 

TTCGCGATTT TATAAATGAA TGTTGATAAC AATGTTGTAT TATCTACTGA 100 

AATCTCATTA CGTTGCATCG GAAACATTGT GTTCTGTATG TAAAAGCCGT 150 

CTTGATAATC TTTAGTAGTA CCGAAGCTGG TCATACGAGA GTTATATTTT 200 

60 CCAGCCAAAA CGATATTTTT ATAATCATTA CGTGAAAAAG GTTTCCCTTC 250 

668 
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ATTATCACAC AAATATTTTA GCTTTTCAGT 
CTTTATCCAT ACGTTGAATA ATTGTACGAT 
ACACCTTTAA TGTTATTTGT TTTAAAAGCA 
ACGATGTGAA TCTTCTAAGA AGTCACCGTA 



PCT/CAOO/01150 

TTCTATATCA ACTGTAGCTT 300 

TCTGACGCAC CATCTTTTGC 350 

TGAATAAGTT TTTCAACACA 4 00 

AAATGAAGGA TC 442 
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2) INFORMATION FOR SEQ ID NO: 1245 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 845 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus anthracis 

(B) STRAIN: CIP 9444* 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1245 

GATGGCGGAA AGCTACCAGA AATCTACAAC GCCCTTACGG TAAAACAGAG 50 

CAACGAAAAC GGAACAAGCA TTAACTTAAC ATTTGAAGTT GCACTTCATT 100 

TAGGTGATGA CACAGTTCGT ACAGTTGCAA TGTCTTCCAC AGATGGACTT 150 

GTTCGTGGCA CAGAAGTAGA AGATACTGGT AAAGCAATCT CTGTACCAGT 200 

TGGTGATGCA ACACTTGGTC GTGTATTTAA CGTATTAGGT GATGCAATTG 250 

ACTTAGATGG TGAGGTTCCT GCGGATGTAC GTCGTGATCC AATTCACCGT 300 

CAAGCACCTG CATTCGAAGA ATTATCTACT AAAGTAGAAA TTCTTGAAAC 350 

TGGTATTAAA GTAGTAGACT TACTTGCTCC TTACATTAAG GGTGGTAAGA 4 00 

TCGGTCTATT CGGTGGTGCC GGTGTAGGTA AAACGGTATT AATTCAGGAA 4 50 

TTAATCAATA ACATCGCACA AGAACACGGT GGTATCTCTG TATTCGCTGG 500 

TGTAGGTGAG CGTACTCGTG AGGGTAATGA CTTATACCAC GAAATGAGCG 550 

ATTCTGGCGT AATTAAGAAA ACTGCGATGG TATTCGGACA AATGAACGAG 600 

CCACCTGGAG CACGTCAACG TGTTGCGTTA ACAGGTTTAA CAATGGCTGA 650 

GCATTTCCGT GATGAGCAAG GACAAGATGT ACTTCTGTTC ATCGATAATA 7 00 

TCTTCCGTTT CACGCAAGCA GGTTCTGAAG TATCTGCCCT TCTTGGCCGT 750 

ATGCCATCTG CGGTAGGTTA CCAACCAACA CTTGCAACAG AAATGGGTCA 800 

ATTACAAGAG CGTATTACAT CTACAAATAA AGGGTCTATC ACGTC 845 



2) INFORMATION FOR SEQ ID NO: 1246 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 656 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus mycoides 

(B) STRAIN: ATCC 11986 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1246 

TGCACTTCAT TTAGGTGATG ACACAGTTCG TACAGTTGCA ATGTCTTCCA 50 

670 
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CAGATGGACT 
TCTGTACCAG 
TGATGCAATT 
CAATTCACCG 
ATTCTTGAAA 
GGGTGGTAAG 
TAATTCAGGA 
GTATTCGCTG 
CGAAATGAGC 
AAATGAACGA 
ACAATGGCTG 
CATCGATAAC 
TTCTTG 



TGTTCGTGGC 
TTGGTGATGT 
GACTTAGATG 
TCAAGCGCCT 
CTGGTATTAA 
ATTGGTCTAT 
ATTAATTAAT 
GTGTAGGTGA 
GATTCTGGCG 
GCCACCTGGA 
AACATTTCCG 
ATCTTCCGTT 



ACAGAAGTAG 
AACACTTGGT 
GTGATGTTCC 
GCATTCGAAG 
AGTAGTAGAC 
TCGGTGGTGC 
AACATCGCAC 
GCGTACTCGT 
TAATTAAGAA 
GCACGTCAAC 
TGATGAGCAA 
TCACGCAAGC 



AAGATACTGG TAAAGCAATC 
CGTGTATTTA ACGTATTAGG 
TGCGGATGTA CGTCGTGATC 
AGTTATCTAC TAAAGTAGAA 
TTACTTGCTC CTTACATTAA 
CGGCGTAGGT AAAACAGTAT 
AAGAGCACGG TGGTATCTCT 
GAAGGTAACG ACTTATACCA 
AACTGCGATG GTATTCGGAC 
GTGTTGCATT AACAGGTTTA 
GGACAAGACG TACTATTGT'T 
GGGTTCTGAA GTATCTGCCC 



100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
656 



10 



15 



20 



25 



30 



35 



40 



45 



2) INFORMATION FOR SEQ ID NO: 1247 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 91 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus thuringiensis 

(B) STRAIN: ATCC 10792 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1247 

CGAAAACGGA AGTATTAACT TAACATTTGA AGTTGCACTT CATTTAGGTG 50 

ATGATACAGT TCGTACAGTT GCGATGTCTT CCACAGATGG ACTTGTTCGT 100 

GGCACAGAAG TAGAAGATAC TGGTAAACCA ATCTCTGTAC CAGTTGGTGA 150 

TGTAACACTT GGTCGCGTAT TTAACGTATT AGGTGATGCA ATTGACTTAG 200 

ATGGTGAGGT TCCTGCAGAT GTACATCGTG ATCCAATTCA CCGTCAAGCA 250 

CCTGCATTCG AAGAATTATC TACTAAAGTA GAAATTCTTG AAACTGGTAT 300 

TAAAGTAGTA GACTTACTTG CTCCTTACAT TAAGGGTGGT AAGATCGGCC 350 

TATTCGGTGG TGCCGGCGTA GGTAAAACAG TATTAATTCA GGAATTAATT 400 

AACAACATCG CACAAGAGCA CGGTGGTATC TCTGTATTCG CTGGTGTAGG 450 

TGAGCGTACT CGTGAGGGTA ATGACTTATA CCACGAAATG AGCGATTCTG 500 

GCGTAATCAA GAAAACTGCG ATGGTATTCG GACAAATGAA CGAGCCACCT 550 

GGAGCACGTC AACGTGTTGC ATTAACAGGT TTAACAATGG CTGAGCATTT 600 

CCGTGATGAG CAAGGACAAG ACGTACTTCT GTTCATCGAT AACATCTTCC 650 

GTTTCACGCA AGCGGGTTCT GAAGTATCTG CCCTTCTTGG TCGTATGCCA 700 

TCTGCGGTAG GTTACCAACC AACACTTGCA ACAGAAATGG GTCAATTACA 750 

AGAGCGTATT ACATCTACAA ATAAAGGGTC TATCACGTCT A 791 



2) INFORMATION FOR SEQ ID NO: 1248 
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SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 825 bases 

(B) TYPE: Nucleic acid 

(C) STRAN DEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus thuringiensis 

(B) STRAIN: BGSC 4AC1 

(Xi) SEQUENCE DESCRIPTION: SEQ^ ID NO: 1248 



15 ATCTACAATG 
CTTAACATTT 
TTGCGATGTC 
ACTGGTAAAG 
ATTCAACGTA 

20 ATGTACACCG 
TCTACTAAAG 
TGCTCCTTAC 
TAGGTAAAAC 
CACGGTGGTA 

25 TAATGACTTA 
CGATGGTATT 
GCATTAACAG 
AGACGTACTT 
CTGAAGTATC 

30 CCAACACTTG 
AAATAAAGGG 



CCCTTACGGT 
GAAGTTGCAC 
TTCCACAGAT 
CAATCTCTGT 
TTAGGTGATG 
TGATCCAATT 
TAGAAATTCT 
ATTAAGGGTG 
AGTATTAATT 
TCTCTGTATT 
TACCACGAAA 
CGGACAAATG 
GTTTAACAAT 
CTGTTCATCG 
TGCCCTTCTT 
CAACAGAAAT 
TCTATCACGT 



AAAACAAAGC 
TTCATTTAGG 
GGACTTGTTC 
ACCAGTTGGT 
CAATTGACTT 
CACCGTCAAG 
TGAAACTGGT 
GTAAGATCGG 
CAGGAGTTAA 
CGCTGGTGTA 
TGAGCGATTC 
AACGAGCCAC 
GGCTGAGCAT 
ATAACATCTT 
GGTCGTATGC 
GGGTCAATTA 
CTATC 



AACGAAAACG 
TGATGATACA 
GTGGCACAGA 
GATGCAACAC 
AGATGGTGAA 
CACCTGCATT 
ATTAAAGTAG 
CCTATTCGGT 
TCAATAACAT 
GGTGAGCGTA 
TGGCGTAATC 
CTGGAGCACG 
TTCCGTGATG 
CCGTTTCACG 
CATCTGCGGT 
CAAGAGCGTA 



GAAGCATGAA 
GTTCGTACAG 
AGTAGAAGAT 
TTGGACGTGT 
CTTCCTGCGG 
CGAAGAATTA 
TAGACTTACT 
GGTGCCGGCG 
CGCACAAGAG 
CTCGTGAGGG 
AAGAAAACTG 
TCAACGTGTT 
AGCAAGGACA 
CAAGCGGGTT 
AGGTTACCAA 
TTACATCTAC 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
150 
800 
825 



35 2) INFORMATION FOR SEQ ID NO: 1249 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 775 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

45 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus thuringiensis 

(B) STRAIN: HER 1236 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1249 

ATCTACAACG CCCTTACGGT AAAACAAAGC AACGAAAACG GAAGTATTAA 50 
CTTAACATTT GAAGTTGCAC TTCATTTAGG TGATGATACA GTTCGTACAG 100 
TTGCGATGTC TTCCACAGAT GGACTTGTTC GTGGCACAGA AGTAGAAGAT 150 
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ACTGGTAAAC CAATCTCTGT ACCAGTTGGT GATGTAACAC TTGGTCGCGT 
ATTTAACGTA TTAGGTGATG CAATTGACTT AGATGGTGAG GTTCCTGCAG 
ATGTACATCG TGATCCAATT CACCGTCAAG CACCTGCATT CGAAGAATTA 
TCTACTAAAG TAGAAATTCT TGAAACTGGT ATTAAAGTAG TAGACTTACT 
TGCTCCTTAC ATTAAGGGTG GTAAGATCGG CCTATTCGGT GGTGCCGGCG 
TAGGTAAAAC AGTATTAATT CAGGAATTAA TTAACAACAT CGCACAAGAG 
CACGGTGGTA TCTCTGTATT CGCTGGTGTA GGTGAGCGTA CTCGTGAGGG 
TAATGACTTA TACCACGAAA TGAGCGATTC TGGCGTAATC AAGAAAACTG 
CGATGGTATT CGGACAAATG AACGAGCCAC CTGGAGCACG TCAACGTGTT 
GCATTAACAG GTTTAACAAT GGCTGAGCAT TTCCGTGATG AGCAAGGACA 
AGACGTACTT CTGTTCATCG ATAACATCTT CCGTTTCACG CAAGCGGGTT 
CTGAAGTATC TGCCCTTCTT GGTCGTATGC CATCTGCGGT AGGTTACCAA 
CCAACACTTG CAACAGAAAT GGGTC 



200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
775 



2 ) INFORMATION FOR SEQ ID NO: 1250 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 832 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus weihenstephanensis 

(B) STRAIN: WSBC 10204 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1250 

CCAGCAATCT ACAACGCCCT TACGGTAAAA CAAAGCAACG AAAACGGAGC 50 

GAGCATCAAC TTAACATTTG AAGTTGCACT TCATTTAGGT GATGACACAG 100 

TTCGTACAGT TGCGATGTCT TCCACAGATG GACTTGTTCG TGGCACAGAA 150 

GTAGAAGATA CTGGTAAAGC AATCTCTGTA CCAGTTGGTG ATGTAACACT 200 

TGGTCGCGTA TTCAACGTAT TAGGTGATGC AATTGACTTA GATGGTGATG 250 

TTCCTGCGGA TGTACGTCGT GATCCAATTC ACCGTCAAGC ACCTGCATTC 300 

GAAGAACTAT CTACAAGAAT AGAAATTCTT GAAACTGGTA TTAAAGTAGT 350 

AGATTTACTT GCTCCTTACA TTAAGGGTGG TAAGATCGGT CTATTCGGTG 400 

GTGCCGGTGT AGGTAAAACG GTATTAATTC AGGAATTAAT TAACAACATC 450 

GCACAAGAGC ACGGTGGTAT CTCTGTATTC GCTGGTGTAG GTGAGCGTAC 500 

TCGTGAGGGT AATGACTTAT ACCACGAAAT GAGCGATTCT GGCGTAATTA 550 

AGAAAACTGC GATGGTATTT GGACAAATGA ACGAGCCACC TGGAGCACGT 600 

CAACGTGTTG CATTAACAGG TTTAACAATG GCTGAACATT TCCGTGATGA 650 

GCAAGGACAA GACGTACTAT TGTTCATCGA TAACATCTTC CGTTTCACGC 700 

AAGCAGGTTC TGAAGTATCT GCCCTTCTTG GTCGTATGCC ATCTGCGGTA 750 

GGTTACCAAC CAACACTTGC AACAGAAATG GGTCAATTAC AAGAGCGTAT 800 

TACATCTACA AATAAAGGGT CTATCACGTC TA 832 



2) INFORMATION FOR SEQ ID NO: 1251 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 802 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus thuringiensis 

(B) STRAIN: HER 1418 

(xi) SEQUENCE DESCRIPTION: SEQ' ID NO: 1251 



AAATCTACAA CGCCCTTACG GTAAAACAAA 
AACTTAACAT TTGAAGTTGC ACTTCATTTA 
AGTTGCGATG TCTTCCACAG ATGGACTTGT 
ATACTGGTAA ACCAATCTCT GTACCAGTTG 
GTATTTAACG TATTAGGTGA TGCAATTGAC 
AGATGTACAT CGTGATCCAA TTCACCGTCA 
TATCTACTAA AGTAGAAATT CTTGAAACTG 
CTTGCTCCTT ACATTAAGGG TGGTAAGATC 
CGTAGGTAAA ACAGTATTAA TTCAGGAATT 
AGCACGGTGG TATCTCTGTA TTCGCTGGTG 
GGTAATGACT TATACCACGA AATGAGCGAT 
TGCGATGGTA TTCGGACAAA TGAACGAGCC 
TTGCATTAAC AGGTTTAACA ATGGCTGAGC 
CAAGACGTAC TTCTGTTCAT CGATAACATC 
TTCTGAAGTA TCTGCCCTTC TTGGTCGTAT 
AACCAACACT TGCAACAGAA ATGGGTCAAT 
AC 



GCAACGAAAA CGGAAGTATT 50 

GGTGATGATA CAGTTCGTAC 100 

TCGTGGCACA GAAGTAGAAG 150 

GTGATGTAAC ACTTGGTCGC 200 

TTAGATGGTG AGGTTCCTGC 250 

AGCACCTGCA TTCGAAGAAT 300 

GTATTAAAGT AGTAGACTTA 350 

GGCCTATTCG GTGGTGCCGG 4 00 

AATTAACAAC ATCGCACAAG 450 

TAGGTGAGCG TACTCGTGAG 500 

TCTGGCGTAA TCAAGAAAAC 550 

ACCTGGAGCA CGTCAACGTG 600 

ATTTCCGTGA TGAGCAAGGA 650 

TTCCGTTTCA CGCAAGCGGG 700 

GCCATCTGCG GTAGGTTACC 750 

TACAAGAGCG TATTACATCT 800 

802 



2) INFORMATION FOR SEQ ID NO: 1252 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 823 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus thuringiensis 

(B) STRAIN: HER 1410 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1252 

AAATCTACAA CGCCCTTACG GTAAAACAAA GCAACGAAAA CGGAAGTATT 50 
AACTTAACAT TTGAAGTTGC ACTTCATTTA GGTGATGATA CAGTTCGTAC 100 
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AGTTGCAATG TCTTCCACAG ATGGACTTGT TCGTGGCACA GAAGTAGAAG 150 

ATACTGGTAA ACCAATCTCT GTACCAGTTG GTGATGTAAC ACTTGGTCGC 200 

GTATTTAACG TATTAGGTGA TGCAATTGAC TTAGATGGTG AGGTTCCTGC 250 

AGATGTACAT CGTGATCCAA TTCACCGTCA AGCACCTGCA TTCGAAGAAT 300 

5 TATCTACTAA AGTAGAAATT CTTGAAACTG GTATTAAAGT AGTAGACTTA 350 

CTTGCTCCTT ACATTAAGGG TGGTAAGATC GGCCTATTCG GTGGTGCCGG 400 

CGTAGGTAAA ACAGTATTAA TTCAGGAATT AATTAACAAC ATCGCACAAG 4 50 

AGCACGGTGG TATCTCTGTA TTCGCTGGTG TAGGTGAGCG TACTCGTGAG 500 

GGTAATGACT TATACCACGA AATGAGCGAT TCTGGCGTAA TCAAGAAAAC 550 

10 TGCGATGGTA TTCGGACAAA TGAACGAGCC ACCTGGAGCA CGTCAACGTG 600 

TTGCATTAAC AGGCTTAACA ATGGCTGAGC ATTTCCGTGA TGAGCAAGGA 650 

CAAGACGTAC TTCTGTTCAT CGATAACATC TTCCGTTTCA CGCAAGCGGG 700 

TTCTGAAGTA TCTGCCCTTC TTGGTCGTAT GCCATCTGCG GTAGGTTACC 750 

AACCAACACT TGCAACAGAA ATGGGTCAAT TACAAGAGCG TATTACATCT 800 

15 ACAAATAAAG GGTCTATCAC GTC 823 



20 



2) INFORMATION FOR SEQ ID NO: 1253 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 798 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
25 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
30 (A) ORGANISM: Bacillus cereus 

(B) STRAIN: ATCC 13472 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1253 

35 CCAGAAATCT ACAATGCCCT TACGGTAAAA CAAAGCAACG AAAACGGAAG 50 

CATGAACTTA ACATTTGAAG TTGCACTTCA TTTAGGTGAT GATACAGTTC 100 

GTACAGTTGC GATGTCTTCC ACAGATGGAC TTGTTCGTGG CACAGAAGTA 150 

GAAGATACTG GTAAAGCAAT CTCTGTACCA GTTGGTGATG CAACACTTGG 200 

ACGTGTATTC AACGTATTAG GTGATGCAAT TGACTTAGAT GGTGAACTTC 250 

40 CTGCGGATGT ACACCGTGAT CCAATTCACC GTCAAGCACC TGCATTCGAA 300 

GAATTATCTA CTAAAGTAGA AATTCTTGAA ACTGGTATTA AAGTAGTAGA 350 

CTTACTTGCT CCTTACATTA AGGGTGGTAA GATCGGCCTA TTCGGTGGTG 4 00 

CCGGCGTAGG TAAAACAGTA TTAATTCAGG AATTAATCAA TAACATCGCA 4 50 

CAAGAGCATG GTGGTATCTC TGTATTCGCT GGTGTAGGTG AGCGTACTCG 500 

45 TGAGGGTAAT GACTTATACC ACGAAATGAG CGATTCTGGC GTAATCAAGA 550 

AAACTGCGAT GGTATTCGGA CAAATGAACG AGCCACCTGG AGCACGTCAA 600 

CGTGTTGCAT TAACAGGTTT AACAATGGCT GAGCATTTCC GTGATGAGCA 650 

AGGACAAGAC GTACTTCTGT TCATCGATAA CATCTTCCGT TTCACGCAAG 700 

CGGGTTCTGA AGTATCTGCC CTTCTTGGTC GTATGCCATC TGCGGTAGGT 750 

50 TACCAACCGA CACTTGCAAC AGAAATGGGT CAATTACAAG AGCGTATT 7 98 
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2) INFORMATION FOR SEQ ID NO: 1254 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 767 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus cereus 

(B) STRAIN: ATCC 7064 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1254 

CATTTGAAGT TGCACTTCAT TTAGGTGATG ACACAGTTCG TACAGTTGCA 50 

ATGTCTTCCA CAGATGGACT TGTTCGTGGC ACAGAAGTAG AAGATACTGG 100 

TAAAGCAATC TCTGTACCAG TTGGTGATGC AACACTTGGT CGTGTATTTA 150 

ACGTATTAGG TGATGCAATT GACTTAGATG GTGAGGTTCC TGCGGATGTA 200 

CGTCGTGATC CAATTCACCG TCAAGCACCT GCATTCGAAG AATTATCTAC 250 

TAAAGTAGAA ATTCTTGAAA CTGGTATTAA AGTAGTAGAC TTACTTGCTC 300 

CTTACATTAA GGGTGGTAAG ATCGGTCTAT TCGGTGGTGC CGGTGTAGGT 350 

AAAACGGTAT TAATTCAGGA ATTAATCAAT AACATCGCAC AAGAACACGG 4 00 

TGGTATCTCT GTATTCGCTG GTGTAGGTGA GCGTACTCGT GAGGGTAATG 450 

ACTTATACCA CGAAATGAGC GATTCTGGCG TAATTAAGAA AACTGCGATG 500 

GTATTCGGAC AAATGAACGA GCCACCTGGA GCACGTCAAC GTGTTGCGTT 550 

AACAGGTTTA ACAATGGCTG AGCATTTCCG TGATGAGCAA GGACAAGACG 600 

TACTTCTGTT CATCGATAAT ATCTTCCGTT TCACGCAAGC AGGTTCTGAA 650 

GTATCTGCCC TTCTTGGCCG TATGCCATCT GCGGTAGGTT ACCAACCAAC 700 

ACTTGCAACA GAAATGGGTC AATTACAAGA GCGTATTACA TCTACAAATA 750 

AAGGGTCTAT CACGTCT 7 67 



2) INFORMATION FOR SEQ ID NO: 1255 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1174 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus aureus 

(B) STRAIN: C-14 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1255 

GAAATGCGTG AATCATTTTT AGATTATGCG ATGAGTGTTA TCGTTGCTCG 50 
TGCATTGCCA GATGTTCGTG ACGGTTTAAA ACCAGTACAT CGTCGTATAC 100 
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TATATGGATT 
TCAGCACGTA 
CTTATCTATT 
GTTATCCGCT 
5 GGCGCAGCAG 
TGAACTGTTA 
ATGATGGTAA 
TTATTAGCCA 
TCCACCACAT 

10 AGAACCCTGA 
GATTTCCCAA 
ATATGAAACA 
AAGAACGTGG 
CAAGTGAATA 

15 CAAGAAAATT 
CTGGTGTGCG 
ATTTTAAATA 
GAATATGATT 
AAGCGTTGGT 

20 ACGCAATACA 
ATTACGTATC 
AGTCAGATAC 



AAATGAACAA 
TCGTTGGTGA 
TATGAAGCAA 
TGTTGATGGC 
CAATGCGTTA 
CGTGATATTA 
TGAAAGAGAG 
ATGGTGCATC 
AACTTAACAG 
TATTTCAATT 
CTGCTGGACT 
GGTCGTGGTT 
AGNCGGACGT 
AGGCTCGTAT 
GACGGTATCA 
TGTCGTTATT 
ACTTATACAA 
GCACTTGTAA 
ACATTATTTA 
ACTTACGTAA 
GCACTTGACC 
AGATAAAGTT 



GGTATGACAC 
CGTAATGGGT 
TGGTACGTAT 
CAAGGTAACT 
TACTGAAGCG 
ATAAAGATAC 
CCGTCAGTCT 
AGGTATCGCG 
AATTAATCAA 
GCTGAGTTAA 
TATTTTAGGT 
CAATTCAAAT 
CAACGTATTG 
GATTGAAAAA 
CTGATTTACG 
GATGTGCGTA 
ACAAACACCT 
ATGGTAGACC 
GAGCATCAAA 
AGCTAAAGAT 
ATATCGATGA 
GCAA 



CGGATAAATC 
AAATATCACC 
GGCTCAAGAT 
TTGGTTCAAT 
CGTATGACTA 
AATAGATTTT 
TACCTGCTCG 
GTAGGTATGG 
TGGTGTACTT 
TGGAGGATAT 
AAGAGTGGTA 
GCGTTCTCGT 
TTGTCACTGA 
ATTGCAGAGC 
TGATGAAACA 
AG GAT GC AAA 
CTTCAAACAT 
GAAGCTTATT 
AGACAGTTGT 
CGTGCCCACA 
AATTATTTCA 



ATATAAAAAA 150 

CTCATGGTGA 200 

TTCAGTTATC 250 

GGATGGAGAT 300 

AAATCACACT 350 

ATCGATAACT 400 

ATTCCCTAAC 450 

CAACGAATAT 500 

AGCTTAAGTA 550 

TGAAGGTCCT 600 

TTAGACGTGC 650 

GCAGTTATTG 700 

AATTCCTTTC 750 

TCGTTCGTGA 800 

AGTTTACGTA 850 

TGCTAGTGTC 900 

CATTTGGTGT 950 

AATTTAAAAG 1000 

TAGAAGACGT 1050 

TTTTAGAAGG 1100 

ACGATTCGTG 1150 
1174 



25 

2) INFORMATION FOR SEQ ID NO: 1256 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 780 bases 
30 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

35 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus weihenstephanensis 

(B) STRAIN: WSBC 10209 

40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1256 



ATCTACAACG CCCTTACGGT AAAACAAAGC AACGAAAACG GAGCAAGCAT 50 

TAACTTAACA TTTGAAGTTG CACTTCATTT AGGTGATGAC ACAGTTCGTA 100 

CAGTTGCAAT GTCTTCCACA GATGGACTTG TTCGTGGCAC AGAAGTAGAA 150 

45 GATACTGGTA AAGCAATCTC TGTACCAGTT GGTGATGTAA CACTTGGTCG 200 

TGTATTTAAC GTATTAGGTG ATGCAATTGA CTTAGATGGA GATGTTCCTG 250 

CGGATGTACG TCGTGATCCA ATTCACCGTC AAGCGCCTGC ATTCGAAGAG 300 

TTATCTACTA AAGTAGAAAT TCTTGAAACT GGTATTAAAG TAGTAGACTT 350 

ACTTGCTCCT TACATTAAGG GTGGTAAGAT CGGTCTATTC GGTGGTGCCG 400 

50 GTGTAGGTAA AACAGTATTA ATTCAGGAAT TAATTAACAA CATCGCACAA 4 50 

GAGCACGGTG GTATCTCTGT ATTCGCTGGT GTAGGTGAGC GTACTCGTGA 500 

AGGTAACGAC TTATACCACG AAATGAGCGA TTCTGGCGTA ATTAAGAAAA 550 

CTGCGATGGT ATTCGGACAA ATGAACGAGC CACCTGGAGC ACGTCAACGT 600 
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GTTGCATTAA CAGGTTTAAC AATGGCTGAA CATTTCCGTG ATGAGCAAGG 

GCAAGACGTA CTATTGTTCA TCGATAACAT CTTCCGTTTC ACGCAAGCGG 

GTTCTGAAGT ATCTGCCCTT CTTGGTCGTA TGCCATCTGC GGTAGGTTAC 
CAGCCAACAC TTGCAACAGA AATGGGTCAA 



650 
700 
750 
780 



2) INFORMATION FOR SEQ ID NO: 1257 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 817 bases 

(B) TYPE: Nucleic acid 

(C) STRAN DEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus anthracis 

(B) STRAIN: CIP 7700 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1257 

CTACCAGAAA TCTACAACGC CCTTACGGTA AAACAGAGCA ACGAAAACGG 50 

AACAAGCATT AACTTAACAT TTGAAGTTGC ACTTCATTTA GGTGATGACA 100 

CAGTTCGTAC AGTTGCAATG TCTTCCACAG ATGGACTTGT TCGTGGCACA 150 

GAAGTAGAAG ATACTGGTAA AGCAATCTCT GTACCAGTTG GTGATGCAAC 200 

ACTTGGTCGT GTATTTAACG TATTAGGTGA TGCAATTGAC TTAGATGGTG 250 

AGGTTCCTGC GGATGTACGT CGTGATCCAA TTCACCGTCA AGCACCTGCA 300 

TTCGAAGAAT TATCTACTAA AGTAGAAATT CTTGAAACTG GTATTAAAGT 350 

AGTAGACTTA CTTGCTCCTT ACATTAAGGG TGGTAAGATC GGTCTATTCG 4 00 

GTGGTGCCGG TGTAGGTAAA ACGGTATTAA TTCAGGAATT AATCAATAAC 4 50 

ATCGCACAAG AACACGGTGG TATCTCTGTA TTCGCTGGTG TAGGTGAGCG 500 

TACTCGTGAG GGTAATGACT TATACCACGA AATGAGCGAT TCTGGCGTAA 550 

TTAAGAAAAC TGCGATGGTA TTCGGACAAA TGAACGAGCC ACCTGGAGCA 600 

CGTCAACGTG TTGCGTTAAC AGGTTTAACA ATGGCTGAGC ATTTCCGTGA 650 

TGAGCAAGGA CAAGATGTAC TTCTGTTCAT CGATAATATC TTCCGTTTCA 700 

CGCAAGCAGG TTCTGAAGTA TCTGCCCTTC TTGGCCGTAT GCCATCTGCG 7 50 

GTAGGTTACC AACCAACACT TGCAACAGAA ATGGGTCAAT TACAAGAGCG 800 

TATTACATCT ACAAATA 817 



2) INFORMATION FOR SEQ ID NO: 1258 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 829 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

.(ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus thuringiensis 

(B) STRAIN: HER 1404 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1258 



CCAGAAATCT ACAACGCCCT TACGGTAAAA 

TATTAACTTA ACATTTGAAG TTGCACTTCA 

GTACAGTTGC GATGTCTTCC ACAGATGGAC 

10 GAAGATACTG GTAAACCAAT CTCTGTACCA 

TCGCGTATTT AACGTATTAG GTGATGCAAT 

CTGCAGATGT ACATCGTGAT CCAATTCACC 

GAATTATCTA CTAAAGTAGA AATTCTTGAA 

CTTACTTGCT CCTTACATTA AGGGTGGTAA 

15 CCGGCGTAGG TAAAACAGTA TTAATTCAGG 

CAAGAGCACG GTGGTATCTC TGTATTCGCT 

TGAGGGTAAT GACTTATACC ACGAAATGAG 

AAACTGCGAT GGTATTCGGA CAAATGAACG 

CGTGTTGCAT TAACAGGTTT AACAATGGCT 

20 AGGACAAGAC GTACTTCTGT TCATCGATAA 

CGGGTTCTGA AGTATCTGCC CTTCTTGGTC 

TACCAACCAA CACTTGCAAC AGAAATGGGT 

ATCTACAAAT AAAGGGTCTA TCACGTCTA 



CAAAGCAACG AAAACGGAAG 50 

TTTAGGTGAT GATACAGTTC 100 

TTGTTCGTGG CACAGAAGTA 150 

GTTGGTGATG TAACACTTGG 200 

TGACTTAGAT GGTGAGGTTC 250 

GTCAAGCACC TGCATTCGAA 300 

ACTGGTATTA AAGTAGTAGA 350 

GATCGGCCTA TTCGGTGGTG 400 

AATTAATTAA CAACATCGCA 450 

GGTGTAGGTG AGCGTACTCG 500 

CGATTCTGGC GTAATCAAGA 550 

AGCCACCTGG AGCACGTCAA 600 

GAGCATTTCC GTGATGAGCA 650 

CATCTTCCGT TTCACGCAAG 700 

GTATGCCATC TGCGGTAGGT 750 

CAATTACAAG AGCGTATTAC 800 

829 
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2) INFORMATION FOR SEQ ID NO: 1259 

(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 844 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



35 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus cereus 

(B) STRAIN: ATCC 15816 
40 (C) ACCESSION NUMBER: 



(xi) SEQUENCE DESCRIPTION: SEQ 

TGGCGGAAAG CTACCAGAAA TCTACAACGC 

4 5 ACGAAAACGG TGAACTTAAC TTAACATTTG 

GATGATACAG TTCGTACAGT TGCGATGTCT 

TGGCACAGAA GTAGAAGATA CTGGTAAAGC 

ATGCAACACT TGGTCGCGTA TTTAACGTAT 

GATGGTGAGG TTCCTGCGGA TGTACGTCGT 

50 ACCTGCATTC GAAGAATTAT CTACTAAAGT 

TTAAAGTAGT AGACTTACTT GCTCCTTACA 

CTATTCGGTG GTGCCGGTGT AGGTAAAACA 

CAACAACATC GCACAAGAGC ACGGTGGTAT 

67 



ID NO: 1259 

CCTTACGGTA AAACAGAGCA 50 

AAGTTGCACT TCATTTAGGT 100 

TCCACAGATG GACTTGTTCG 150 

AATCTCTGTA CCAGTTGGTG 200 

TAGGTGATGC TATTGACTTA 250 

GATCCAATTC ACCGTCAAGC 300 

AGAAATTCTT GAAACTGGTA 350 

TTAAGGGTGG TAAGATCGGC 400 

GTATTAATTC AGGAGTTAAT 4 50 

CTCTGTATTC GCTGGTGTAG 500 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CAOO/01150 



GTGAGCGTAC TCGTGAGGGT 
GGCGTAATTA AGAAAACTGC 
TGGAGCACGT CAACGTGTTG 
TCCGTGATGA GCAAGGACAA 
CGTTTCACGC AAGCAGGTTC 
ATCTGCGGTA GGTTACCAAC 
AAGAGCGTAT TACATCTACA 



AATGACTTAT ACCACGAAAT GAGCGATTCT 
GATGGTATTC GGACAAATGA ACGAGCCACC 
CATTAACAGG CTTAACAATG GCTGAATATT 
GACGTACTTC TGTTCATCGA TAATATCTTC 
TGAAGTATCT GCCCTTCTTG GCCGTATGCC 
CAACACTTGC AACAGAAATG GGTCAATTAC 
AATAAAGGGT CTATCACGTC TATC 



550 
600 
650 
700 
750 
800 
844 



2) INFORMATION FOR SEQ ID NO: 1260 

(i) SEQUENCE CHARACTERISTICS: t 

(A) LENGTH: 840 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus cereus 

(B) STRAIN: ATCC 49064 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1260 

AAGCTACCAG AAATCTACAA CGCCCTTACG GTAAAACAGA GCAACGAAAA 50 

CGGAACAAGC ATTAACTTAA CATTTGAAGT TGCACTTCAT TTAGGTGATG 100 

ACACAGTTCG TACAGTTGCA ATGTCTTCCA CAGATGGACT TGTTCGTGGC 150 

ACAGAAGTAG AAGATACTGG TAAAGCAATC TCTGTACCAG TTGGTGATGC 200 

AACACTTGGT CGTGTATTTA ACGTATTAGG TGATGCAATT GACTTAGATG 250 

GTGAGGTTCC TGCGGATGTA CGCCGTGATC CAATTCACCG TCAAGCACCT 300 

GCATTCGAAG AATTATCTAC TAAAGTAGAA ATTCTTGAAA CTGGTATTAA 350 

AGTAGTAGAC TTACTTGCTC CTTACATTAA GGGTGGTAAG ATCGGTCTAT 400 

TCGGTGGTGC CGGTGTAGGT AAAACAGTAT TAATTCAGGA ATTAATCAAC 4 50 

AACATCGCAC AAGAACACGG TGGTATCTCT GTATTCGCTG GTGTAGGTGA 500 

GCGTACTCGT GAGGGTAATG ACTTATACCA CGAAATGAGC GATTCAGGCG 550 

TAATTAAGAA AACTGCGATG GTATTCGGAC AAATGAACGA GCCACCTGGA 600 

GCGCGTCAAC GTGTTGCGTT AACAGGTTTA ACAATGGCTG AGCATTTCCG 650 

TGATGAGCAA GGACAAGACG TTCTTCTGTT CATCGATAAT ATCTTCCGTT 7 00 

TCACGCAAGC AGGTTCTGAA GTATCTGCCC TTCTTGGTCG TATGCCATCT 750 

GCGGTAGGTT ACCAACCAAC ACTTGCAACA GAAATGGGTC AATTACAAGA 800 

GCGTATTACA TCTACAAATA AAGGGTCTAT CACGTCTATC 84 0 



2) INFORMATION FOR SEQ ID NO: 1261 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 839 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
5 (A) ORGANISM: Bacillus thuringiensis 

(B) STRAIN: BGSC 4AZ1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1261 

10 GCGGAAAGCT ACCAGAAATC TACAATGCCC TTACGGTAAA ACAAAGCAAC 50 

GAAAACGGAA GCATGAACTT AACATTTGAA GTTGCACTTC ATTTAGGTGA 100 

TGATACAGTT CGTACAGTTG CGATGTCTTC CACAGATGGA CTTGTTCGTG 150 

GCACAGAAGT AGAAGATACT GGTAAAGCAA TCTCTGTACC AGTTGGTGAT 200 

GCAACACTTG GACGTGTATT CAACGTATTA GGTGATGCAA TTGACTTAGA 250 

15 TGGTGAACTT CCTGCGGATG TACACCGTGA TCCAATTCAC CGTCAAGCAC 300 

CTGCATTCGA AGAATTATCT ACTAAAGTAG AAATTCTTGA AACTGGTATT 350 

AAAGTAGTAG ACTTACTTGC TCCTTACATT AAGGGTGGTA AGATCGGCCT 4 00 

ATTCGGTGGT GCCGGCGTAG GTAAAACAGT ATTAATTCAG GAGTTAATCA 4 50 

ATAACATCGC ACAAGAGCAC GGTGGTATCT CTGTATTCGC TGGTGTAGGT 500 

20 GAGCGTACTC GTGAGGGTAA TGACTTATAC CACGAAATGA GCGATTCTGG 550 

CGTAATCAAG AAAACTGCGA TGGTATTCGG ACAAATGAAC GAGCCACCTG 600 

GAGCACGTCA ACGTGTTGCA TTAACAGGTT TAACAATGGC TGAGCATTTC 650 

CGTGATGAGC AAGGACAAGA CGTACTTCTG TTCATCGATA ACATCTTCCG 700 

TTTCACGCAA GCGGGTTCTG AAGTATCTGC CCTTCTTGGT CGTATGCCAT 750 

25 CTGCGGTAGG TTACCAACCA ACACTTGCAA CAGAAATGGG TCAATTACAA 800 

GAGCGTATTA CATCTACAAA TAAAGGGTCT ATCACGTCT 839 



30 2) INFORMATION FOR SEQ ID NO: 1262 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 833 bases 

(B) TYPE: Nucleic acid 
35 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

40 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus thuringiensis 

(B) STRAIN: BGSC 4H2 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1262 



AAGCTACCAG AAATCTACAA TGCCCTTACG GTAAAACAAA GCAACGAAAA 50 

CGGAAGCATG AACTTAACAT TTGAAGTTGC ACTTCATTTA GGTGATGATA 100 

CAGTTCGTAC AGTTGCGATG TCTTCCACAG ATGGACTTGT TCGTGGCACA 150 

GAAGTAGAAG ATACTGGTAA AGCAATCTCT GTACCAGTTG GTGATGCAAC 200 

50 ACTTGGACGT GTATTCAACG TATTAGGTGA TGCAATTGAC TTAGATGGTG 250 

AACTTCCTGC GGATGTACAC CGTGATCCAA TTCACCGTCA AGCACCTGCA 300 

TTCGAAGAAT TATCTACTAA AGTAGAAATT CTTGAAACTG GTATTAAAGT 350 

AGTAGACTTA CTTGCTCCTT ACATTAAGGG TGGTAAGATC GGCCTATTCG 4 00 
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GTGGTGCCGG CGTAGGTAAA ACAGTATTAA 
ATCGCACAAG AGCACGGTGG TATCTCTGTA 
TACTCGTGAG GGTAATGACT TATACCACGA 
TCAAGAAAAC TGCGATGGTA TTCGGACAAA 
5 CGTCAACGTG TTGCATTAAC AGGTTTAACA 
TGAGCAAGGA CAAGACGTAC TTCTGTTCAT 
CGCAAGCGGG TTCTGAAGTA TCTGCCCTTC 
GTAGGTTACC AACCAACACT TGCAACAGAA 
TATTACATCT ACAAATAAAG GGTCTATCAC 



PCT/CAOO/01150 

TTCAGGAATT AATCAATAAC 4 50 

TTCGCTGGTG TAGGTGAGCG 500 

AATGAGCGAT TCTGGCGTAA 550 

TGAACGAGCC ACCTGGAGCA 600 

ATGGCTGAGC ATTTCCGTGA 650 

CGATAACATC TTCCGTTTCA 700 

TTGGTCGTAT GCCATCTGCG 750 

ATGGGTCAAT TACAAGAGCG 800 

GTC 833 



2) INFORMATION FOR SEQ ID NO: 1263 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 790 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



20 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus thuringiensis 
25 (B) STRAIN: BGSC 4Q1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1263 



CGAAAACGGA AGCATGAACT TAACATTTGA AGTTGCACTT CATTTAGGTG 50 

30 ATGATACAGT TCGTACAGTT GCGATGTCTT CCACAGATGG ACTTGTTCGT 100 

GGCACAGAAG TAGAAGATAC TGGTAAAGCA ATTTCTGTAC CAGTTGGTGA 150 

TGTAACACTT GGACGTGTAT TCAACGTATT AGGTGATGCA ATTGACTTAG 200 

ATGGTGAACT TCCTGCGGAT GTACACCGTG ATCCAATTCA CCGTCAAGCA 250 

CCTGCATTCG AAGAATTATC TACTAAAGTA GAAATTCTTG AAACTGGTAT 300 

35 TAAAGTAGTA GACTTACTTG CTCCTTACAT TAAGGGTGGT AAGATCGGCC 350 

TATTCGGTGG TGCCGGTGTA GGTAAAACAG TATTAATTCA GGAATTAATT 400 

AACAACATCG CACAAGAGCA CGGTGGTATC TCTGTATTCG CTGGTGTAGG 4 50 

TGAGCGTACT CGTGAGGGTA ATGACTTATA CCACGAAATG AGCGATTCTG 500 

GCGTAATCAA GAAAACTGCG ATGGTATTCG GACAAATGAA CGAGCCACCT 550 

4 0 GGAGCACGTC AACGTGTTGC ATTAACAGGT TTAACAATGG CTGAGCATTT 600 

CCGTGATGAG CAAGGACAAG ACGTACTTCT GTTCATCGAT AACATCTTCC 650 

GTTTCACGCA AGCGGGTTCT GAAGTATCTG CCCTTCTTGG TCGTATGCCA 700 

TCTGCGGTAG GTTACCAACC AACACTTGCA ACAGAAATGG GTCAATTACA 750 

AGAGCGTATT ACATCTACAA ATAAAGGGTC TATCACGTCT 7 90 

45 



2) INFORMATION FOR SEQ ID NO: 1264 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 64 4 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

682 
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25 



40 



WO 01/23604 PCT/CAOO/01150 
(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus thuringiensis 

(B) STRAIN: HER 1232 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1264 



AGTTGCACTT CATTTAGGTG ATGATACAGT TCGTACAGTT GCGATGTCTT . 50 

CCACAGATGG ACTTGTTCGT GGCACAGAAG TAGAAGATAC TGGTAAACCA 100 

ATCTCTGTAC CAGTTGGTGA TGTAACACTT GGTCGCGTAT TTAACGTATT 150 

AGGTGATGCA ATTGACTTAG ATGGTGAGGT TCCTGCAGAT GTACATCGTG 200 

15 ATCCAATTCA CCGTCAAGCA CCTGCATTCG AAGAATTATC TACTAAAGTA 250 

GAAATTCTTG AAACTGGTAT TAAAGTAGTA GACTTACTTG CTCCTTACAT 300 

TAAGGGTGGT AAGATCGGCC TATTCGGTGG TGCCGGCGTA GGTAAAACAG 350 

TATTAATTCA GGAATTAATT AACAACATCG CACAAGAGCA CGGTGGTATC 400 

TCTGTATTCG CTGGTGTAGG TGAGCGTACT CGTGAGGGTA ATGACTTATA 4 50 

20 CCACGAAATG AGCGATTCTG GCGTAATCAA GAAAACTGCG ATGGTATTCG 500 

GACAAATGAA CGAGCCACCT GGAGCACGTC AACGTGTTGC ATTAACAGGT 550 

TTAACAATGG CTGAGCATTT CCGTGATGAG CAAGGACAAG ACGTACTTCT 600 

GTTCATCGAT AACATCTTCC GTTTCACGCA AGCGGGTTCT GAAG 64 4 



2) INFORMATION FOR SEQ ID NO: 12 65 



(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 823 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

3 5 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus anthracis 

(B) STRAIN: ATCC 4229 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1265 



GGCGGAAAGC TACCAGAAAT CTACAACGCC CTTACGGTAA AACAGAGCAA 50 

CGAAAACGGA ACAAGCATTA ACTTAACATT TGAAGTTGCA CTTCATTTAG 100 

45 GTGATGACAC AGTTCGTACA GTTGCAATGT CTTCCACAGA TGGACTTGTT 150 

CGTGGCACAG AAGTAGAAGA TACTGGTAAA GCAATCTCTG TACCAGTTGG 200 

TGATGCAACA CTTGGTCGTG TATTTAACGT ATTAGGTGAT GCAATTGACT 250 

TAGATGGTGA GGTTCCTGCG GATGTACGTC GTGATCCAAT TCACCGTCAA 300 

GCACCTGCAT TCGAAGAATT ATCTACTAAA GTAGAAATTC TTGAAACTGG 350 

50 TATTAAAGTA GTAGACTTAC TTGCTCCTTA CATTAAGGGT GGTAAGATCG 400 

GTCTATTCGG TGGTGCCGGT GTAGGTAAAA CGGTATTAAT TCAGGAATTA 4 50 

ATCAATAACA TCGCACAAGA ACACGGTGGT ATCTCTGTAT TCGCTGGTGT 500 

AGGTGAGCGT ACTCGTGAGG GTAATGACTT ATACCACGAA ATGAGCGATT 550 
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CTGGCGTAAT TAAGAAAACT GCGATGGTAT 

CCTGGAGCAC GTCAACGTGT TGCGTTAACA 

TTTCCGTGAT GAGCAAGGAC AAGATGTACT 

TCCGTTTCAC GCAAGCAGGT TCTGAAGTAT 

5 CCATCTGCGG TAGGTTACCA ACCAACACTT 

ACAAGAGCGT ATTACATCTA CAA 
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TCGGACAAAT GAACGAGCCA 600 
GGTTTAACAA TGGCTGAGCA 650 
TCTGTTCATC GATAATATCT 700 
CTGCCCTTCT TGGCCGTATG 750 
GCAACAGAAA TGGGTCAATT 800 

823 



10 2) INFORMATION FOR SEQ ID NO: 1266 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 715 bases 

(B) TYPE: Nucleic acid 
15 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

20 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Paracoccidioides brasiliensis 

(B) STRAIN: ATCC 200443 



25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1266 



TGGTCCGAGR CCCGATTCMA TGAAATTATC AAGGAAACCT CCAAYTTCAT 50 

TAAGAAGGTC GGATATAACC CCAAGACTGT TCCTTTCGTT CCCATTTCTG 100 

GTTTCCAGGG TGACAACATG ATCGATSCCT CTGCCAACTG CCCATGGTAC 150 

AAGGGCTGGT ACMAKGAGAC TGCCGACAGG CAAGYACTCT GGCAAGACCC 200 

3 0 TTCTTGAGGC CATTGACGSC ATTGAGCCCC CCAMSCGTCC TWCCGATAAA 250 

CCTCTCCGTC TTCCTCTCCA GGATGTCTAC AAGATCTCCG GTATTGGMAC 300 

TGTTCCTGTC GGACGTRTTG AGACTGGAGT CATCAAGCCC GGTATGGTCG 350 

TGACCTTCGC TCCCGCCAAC GTCACCACTG AAGTCAAGTC CGTTGAAATG 400 

CACCACCAGC AGCTTTCCGA CGGTAWCCCC GGTGACAACG TCGGCTTCAA 450 

35 CGTCAAGAAT GTTTCCGTCA AAGAAGTCCG CCGTGGTAAC GTTGCCTGGT 500 

GACTCTAAGA ATGATCCCGC MAWGGGCTGC GATTCCTTCA ATGCYCAGGT 550 

CATCGTCCTC AACCACCCTG GTCAGGTTGG CGCTGGTTAT GCCCCAGTCC 600 

TCGAYTGCCA TACTGCCCAC ATTGCYTGCA ARTTCGCTGA GMTCMAKGAG 650 

AAGATTGAYC GCCGAACCGG MAAGTCTGTT GAGAACGCCC CCAAGTTCAT 700 

4 0 CAAGTCCGGT GATGC 715 



45 



2) INFORMATION FOR SEQ ID NO: 1267 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 875 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
50 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Blastomyces dermatitidis 

(B) STRAIN: ATCC 56220 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1267 

GAGTCCTCTT ATTTACTTTT GTCATGACTA CCTTACTAAT CTGTCATAGA 50 

TCGTTACAAC GAAATCGTCA AGGAGACTTC CAACTTCATC AAGAAGGTCG 100 

GATACAACCC CAAGAACGTT CCTTTCGTTC CTATCTCCGG TTTCAACGGC 150 

10 GACAACATGC TTGAGCCCTC CCCCAACTGC CCCTGGTACA AGGGTTGGGA 200 

GAAGGAGACC AAGGCCGGTA AGGTCACTGG TAAGACCCTC CTCGAGGCCA 250 

TCGACGCCAT TGAGCCCCCT ACCCGTCCCG CCAACAAGGT CAGTACTACC 300 

TCAATTACTT GAACTCTCTT CATACGTTCC GATTACTGAC TGCTTCACAG 350 

CCCCTCCGTC TTCCCCTCCA GGACGTTTAC AAGATCGGTG GTATTGGAAC 400 

15 GGTGCCCGTC GGTCGTGTTG AGACCGGTAC CATCTCCCCT GGTATGGTCG 450 

TTACCTTGTA TGTATCCTGA CCATCCCCCT TGGCAATCAT TACGTACTAA 500 

CTCACTCTTC AGCGCTCCCG CCAACGTCAC CACTGAAGTC AAGAGTGTTG 550 

AAATGCACCA CCAGCAGCTC GCTGCCGGTC AGCCCGGTGA CAACGTTGGT 600 

TTCAACGTGA AGAACGTCTC CGTCAAGGAA ATCCGTCGTG GTAACGTTGC 650 

20 TGGTGATAGC AAGAACGACC CCCCTGCCGG TGCTGCTTCC TTCAACGCCC 700 

AGGTCATCGT CCTCAACCAC CCCGGTCAGG TCGGTGCTGG TTACGCCCCA 750 

GTCCTTGACT GCCACACTGC CCACATTGCT TGCAAGTTCT CTGAACTCCT 800 

TGAGAAGATT GACCGTCGTA CCGGAAAGTC TGTTGAGGAC CACCCCAAGT 850 

TCATCAAGTC CGGTGACGCT GCCAT 875 

25 



30 



35 



2) INFORMATION FOR SEQ ID NO: 1268 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1124 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



40 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Histoplasma capsulatum 

(B) STRAIN: WSA-377 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1268 



GTGAGCGTGG 
45 AAGTACAGTG 
ATTTCAACTT 
GCCATCGTGA 
• TGCGCTATCC 
CTCCAAGGAT 
50 GTGTGAGGCA 
TCCGAGTCCC 
GAAGGTCGGA 
TCGAGGGTGA 



TATCACCATC 
TCACTGTCAT 
CCAGAGTATC 
CTTCATCAAG 
TCATCATTGC 
GGCCAGACTC 
ACTCATCGTT 
GTTTCAACGA 
TATAACCCCA 
CAACATGATT 



GATATTGCCC 
TGGTGAGTGC 
TACTCTAACA 
AACATGATCA 
TGCCGGTACT 
GTGAGCACGC 
GCCATCAACA 
AATCATCAAG 
AGGCTGTTCC 
GAACCCTCCC 



TCTGGAAATT 
TTTTTACCCC 
TATCCGCTTA 
CTGGTACCTC 
GGTGAGTTCG 
TCTGCTTGCT 
AGATGGACAC 
GAGGTTTCCA 
CTTCGTGCCA 
CCAACTGCAC 



CGAGACCCCG 50 

TCTTAAGCAG 100 

GATGCTCCCG 150 

CCAGGCTGAC 200 

AGGCTGGTAT 250 

TTCACCCTTG 300 

CACCAAGTGG 350 

ACTTCATCAA 400 

ATCTCTGGTT 450 

CTGGTACAAG 500 
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GGCTGGAACA 
CGATGCCATT 
TCCGTCTTCC 
CCCGTCGGAC 
5 TTTCGCTCCC 
ACCAACAACT 
AAGAACGTTT 
CAAAAATGAT 
TCCTTAACCA 
10 TGCCACACTG 
CGACCGCCGT 
CTGC-TGATGC 
GAGCCCTTCA 



AGGAGACTGC 
GACGCCATTG 
CCTCCAGGAT 
GTGTTGAGAC 
TCCAACGTCA 
CCAGGCTGGT 
CAGTCAAGGA 
CCCCCCAAGG 
CCCCGGCCAG 
CCCACATTGC 
ACTGGAAAGT 
TGCTATCGTC 
CTGACTATCC 



CTCTGGCAAG 
AACCCCCAAC 
GTTTACAAAA 
TGGTGTCATC 
CCACTGAAGT 
TACCCTGGTG 
AGTCCGCCGT 
GCTGCGAATC 
GTTGGCGCTG 
TTGCAAGTTC 
CTGTTGAGAA 
AAGATGGTTC 
CCCT 



TCTTCTGGTA 
CCGTCCTACC 
TCTCTGGTAT 
AAGCCCGGTA 
CAAGTCCGTC 
ACAACGTCGG 
GGCAACGTTG 
CTTCAATGCC 
GTTATGCCCC 
TCTGAACTCA 
CAACCCCAAG 
CCTCCAAGCC 



AAACCCTTCT 550 

GATAAGCCTC 600 

TGGCACTGTT 650 

TGGTCGTGAC 700 

GAGATGCACC 750 

CTTCAACGTC 800 

CTGGCGACTC 850 

CAGGTCATCG 900 

AGTCCTCGAC 950 

TTGAGAAGAT 1000 

TTCATCAAGT 1050 

CATGTGCGTG 1100 
1124 



15 



2) INFORMATION FOR SEQ ID NO: 1269 



20 



25 



30 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1043 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trichophyton rubrum 

(B) STRAIN: WSA-224 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1269 



GTGAGCGTGG 
AAGTACAATG 

35 TGGTTGTACC 
ATCAAGAACA 
CATTGCTGCC 
AGACCCGTGA 
ATCGTTGCCA 

4 0 CAAGGAAATT 
ACCCCAAGGG 
ATGATTGAGG 
GACCAAGGCC 
ATGCCATCGA 

4 5 CTCCAGGATG 
TGTTGAGACC 
CCAACGTCAC 
CAGCAGGGTG 
CGTCAAGGAA 

50 CACCATCCGG 
CCCGGCCAGA 
TCACATTGCT 
CCGGTAAATC 



TATCACCATC 
TCACCGTCAT 
ATATCTAACG 
TGATCACTGG 
GGTACTGGTG 
GCACGCTCTG 
TCAACAAGAT 
ATCAAGGAAG 
TGTTCCATTC 
CCTCCACCAA 
GGTGGTGCCA 
CATGCCAACC 
TCTACAAGAT 
GGTATCATCA 
CACTGAAGTC 
TCCCCGGTGA 
GTCCGCCGTG 
CTGTGCCTCC 
TCGGTGCTGG 
TGCAAGTTCG 
CGTCGAAGCC 



GATATCGCCC 
TGGTATGTTT 
AGAGTAGACG 
TACCTCCCAG 
AGTTCGAGGC 
CTCGCCTTCA 
GGACACCACC 
TCACCAACTT 
GTTCCAATCT 
CTGCCCATGG 
AGTCCGGCAA 
CGTCCTACCG 
CTCTGGTATC 
AGCCCGGTAT 
AAGTCCGTYK 
CAACGTCGGC 
GTAACGTTGC 
TTCAACGCCC 
TTACGSTCCA 
CTGAGCTCCT 
AACCCCAAGT 



TCTGGAAGTT 


CGAGACCCCC 


50 


CTTTGCCTTG 


TTCCCTCATG 


100 


CCCCCGGTCA 


CCGTGACTTC 


150 


GCTGACTGCG 


CTATTCTCAT 


200 


TGGTATCTCC 


AAGGATGGCC 


■ 250 


CCCTCGGTGT 


CAAGCAGCTC 


300 


GGCTGGTCCG 


AGGATCGTTT 


350 


CATCAAGAAG 


GTTGGCTACG 


400 


CTGGTTTCAA 


CGGTGACAAC 


450 


TACAAGGGAT 


GGAACAAGGA 


500 


GACCCTCCTC 


GAGGCCATCG 


550 


ACAAGCCCCT 


CCGTCTCCCA 


600 


GGAACTGTGC 


CAGTCGGTCG 


650 


GGTCGTCACC 


TTNGCCCCCG 


700 


AAATGCACCA 


CCAGCAGCTT 


750 


TTCAATGTCA 


AGAACGTTTC 


800 


CGGTGACTCC 


AAGAACGACC 


850 


AGGTCATYGT 


CCTCAACCAC 


900 


GTCCTCGACT 


GCCACACTGS 


950 


CGAGAAC-ATT 


GACCGCCGTA 


1000 


TCGTCAAGTC 


TGG 


1043 



686 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

2) INFORMATION FOR SEQ ID NO: 1270 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 1105 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Microsporvm canis 

(B) STRAIN: WSA-217 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1270 

GCTGAGCGTG AGCGTGGTAT CACCATTGAT ATCGCCCTCT GGAAGTTCGA 50 

GACCCCCAAG TACATGGTCA CCGTCATCGG TATGCTTTAT CTGTTTCCCA 100 

TTTATAGTTG CGACCAGTAA CTAACAAAAA GTAGATGCCC CCGGGCACCG 150 

20 TGACTTCATC AAGAACATGA TTACTGGTAC CTCCCAGGCC GACTGCGCTA 200 

TTCTCATCAT TGCTGCCGGT ACTGGTGAGT TCGAGGCTGG TATCTCCAAG 250 

GATGGCCAGA CTCGTGAGCA CGCCCTGCTC GCTTTCACCC TCGGTGTCAA 300 

GCAGCTCATC GTTGCCATCA ACAAGATGGA CACCACCAAC TGGTCTGAGT 350 

CCCGTTTCGG TGAAATCATC AAGGAAGTCA CCAACTTCAT CAAGAAGGTC 400 

2 5 GGCTACGACC CCAAGGGTGT CCCATTCGTC CCAATCTCTG GCTTCAACGG 450 

TGACAACATG ATTGAGCCCT CCACCAACTG CCCATGGTAC AAGGGATGGA 500 

ACAAGGAGAC CAAGGCCGGT GGCAAATCCT CTGGTAAGAC CCTCCTTGAG 550 

GCCATCGATG CCATTGACAT GCCCACTCGT CCCACCGACA AGCCTCTCCG 600 

TCTCCCACTC CAGGATGTCT ACAAGATCTC TGGTATCGGA ACAGTACCAG 650 

3 0 TCGGTCGTGT TGAGACTGGT ATCATCAAGC CTGGTATGGT TGTCACTTTY 700 

GCCCCCGCCA ACGTCACCAC TGAAGTCAAG TCCGTCGAAA TGCACCACCA 750 

GCAGCTYGTC CAGGGTGTTC CCGGTGACAA CGTTGGCTTC AACGTCAAGA 800 

ACGTYTCTGT CAAGGAAGTC CGCCGTGGTA ACGTTGCCGG TGATTCCAAG 850 

AACGACCCAC CAGCTGGCTG CGCCTCTTTC AAGGCCCAGG TCATCGTCCT 900 

3 5 CAACCACCCC GGCCAGATCG GTGCTGGTTA CGCCCCAGTC CTTGACTGCC 950 

ACACTGCCCA CATTGCTTGC AAGTTCTCTG AGCTTCTTGA GAAGATTGAC 1000 

CGCCGTACTG GTAAATCCGT CGAAACCAGC CCTAAGTTCG TCAAGTCTGG 1050 

TGATGCCGCT ATTGCCACCA TGGTTCCATC CAAGCCCATG TGCGTTGAGG 1100 

CTTTC 1105 

40 



2) INFORMATION FOR SEQ ID NO: 1271 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1244 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



50 



(ii) MOLECULE TYPE: Genomic DNA 
(vi) ORIGINAL SOURCE: 
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(A) ORGANISM: Aspergillus versicolor 

(B) STRAIN: WSA-175 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1271 

5 

GAGCGTGAGC GTGGTATCAC CATCGATATC GCTCTCTGGA AGTTCCAGAC 50 

CCCTAAGTAT GAGGTCACCG TCATTGGTAT GTTGTCCTTC TTGTGTTACC 100 

ATCGAAACAT ATCTAACCTA CAACTGCAGA CGCCCCCGGT CACCGTGACT 150 

TCATCAAGAA CATGATCACT GGTACCTCCC AGGCCGACTG CGCTATTCTC 200 

10 ATCATTGCTT CCGGTACTGG TGAATTCGAG GCTGGTATCT CCAAGGATGG 250 

CCAGACCCGT GAGCACGCTC TGCTCGCTTT CACCCTCGGT GTCCGTCAGC 300 

TCATCGTTGC CCTCAACAAG ATGGACACTG CTGGCTGGGC TGAGGCTCGT 350 

TACAACGAAA TCGTCAAGGA AACTTCCGGT TTCATCAAGA AGGTCGGCTA 400 

CAACCCCAAG TCGGTTCCCT TCGTCCCCAT CTCCGGTTTC AACGGTGACA 450 

15 ACATGCTTGA GCCCTCCTCC AACTGCCCCT GGTACAAGGG TTGGGAGAAG 500 

GAGACCAAGG CTGGTAAGGC CACTGGTAAG ACCCTCCTCG AGGCCATCGA 550 

CGCCATTGAG CCTCCCGTCC GTCCCTCCAA CAAGCCTCTC CGTCTTCCCC 600 

TCCAGGATGT CTACAAGATC TCTGGTATTG GAACTGTCCC CGTCGGCCGT 650 

GTCGAGACCG GTACCATCGT CCCCGGTATG GTCGTCACCT TCGCTCCCGC 700 

20 CAACGTCACC ACTGAAGTCA AGTCCGTTGA GATGCACCAC CAGCAGCTCA 750 

AGGAGGGTGT TCCCGGTKAC AACGTTGGTT TCAACGTGAA GAACGTTTCC 800 

GTCAAGGAAG TCCGCCGTGG TAACGTCGCT GGTGACTCCA AGAACGACCC 850 

CCCTGCCGGT GCTGCCTCTT TCACCGCCCA GGTCATCGTC CTCAACCACC 900 

CCGGTCAGGT CGGCGCTGGT TACGCTCCCG TCCTCGACTG CCACACCGCT 950 

25 CACATTGCCT GCAAGTTCGC TGAGCTCCAG GAGAAGATCG ACCGCCGTAC 1000 

CGGAAAGTCT GTCGAATYTG CCCCCAAGTT CATCAAGTCT GGTGACGCCG 1050 

CTATCGTCAA GATGATTCCC TCCAAGCCCA TGTGTGTCGA GTCTTTCACT 1100 

GACTACCCTC CTYTCGGCCG TTTCGCCGTC CGTGACGTAA GTTCTTTCCC 1150 

CAGCTTTTCG ATGCTACCCT TCTMTGAATC ACGTGTCATG TCTTGGCACC 1200 

30 CGCCCATCAC ATGACCACGC AACCCTATAC CCCGCCACAC CCTT 124 4 



2) INFORMATION FOR SEQ ID NO: 1272 



35 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1032 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
40 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
45 (A) ORGANISM: Exophiala moniliae 

(B) STRAIN: WSA-219 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1272 

50 GCTGAGCGTG AGCGTGGTAT CACCATCGAT ATCGCTCTCT GGAAGTTCGA 50 

GACCCCCAAG TACTATGTCA CCGTCATCGA CGCCCCCGGT CATCGTGACT 100 

TCATCAAGAA CATGATCACT GGTACTTCCC AAGCTGACTG CGCCATTCTC 150 

ATCATTGCTG CCGGTACTGG TGAATTCGAA GCCGGTATCT CCAAGGATGG 200 
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TCAGACCCGT GAGCACGCTC TGCTTGCCTA CACCCTGGGT GTCAAGCAGC 250 

TCATTGTCGC CATCAACAAG ATGGACACTA CCAAGTGGTC TGAGGACCGT 300 

TTCAACGAAA TCATCAAGGA GACTTCCAGC TTCATCAAGA AGGTCGGCTA 350 

CAACCCCAAG TCCGTTCCTT TCGTCCCCAT CTCCGGCTTC AACGGTGACA 400 

5 ACATGATCGA CGTCTCCACC AACTGCCCCT GGTACAAGGG CTGGGAGAAG 4 50 

GAGACCAAGG CTGGCAAGGC CTCTGGCAAG ACTCTCCTTG AGGCCATCGA 500 

CGCCATTGAC CCCCCCTCTC GTCCCACCGA CAAGCCTYTC CGTCTCCCTC 550 

TCCAGGATGT GTACAAGATC TCTGGTATCG GAACGGTGCC CGTCGGTCGT 600 

GTCGAGACTG GTATCATCAA GGCCGGTATG GTCGTTACCT TCGCTCCTGC 650 

10 CAACGTCACC ACTGAAGTCA AGTCCGTCGA AATGCACCAC GAACAGCTCG 7 00 

CCGAGGGTGT TCCAGGTGAC AACGTCGGTT TCAACGTCAA GAACGTYTCC 750 

GTCAAGGAGG TTCGTCGTGG AAACGTTTGC GGTGACTCCA AGAACGACCC 800 

ACCCAAGGGC GCTGATTCCT TCAACGCCCA GGTCATCGTC TTGAACCACC 850 

CTGGTCAAGT TGGTGCTGGC TACGCCCCAG TGTTGGATTG CCACACTGCC 900 

15 CACATTGCTT GCAAGTTCTC TGAGCTTCTC GAGAAGATCG ATCGTCGKAC 950 

CGGAAAGTCG ATCGAAAACA ACCCCAAGTT CATCAAGTCT GGTGACGCTG 1000 

CCATCGTCAA GATGGTTCCC AGCAAGCCCA TG 1032 



20 



2) INFORMATION FOR SEQ ID NO: 1273 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1106 bases 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



30 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Hortaea werneckii 

(B) STRAIN: ATCC 34944 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1273 

TAACAGCCCG TGCGACCGTC TCGTCCAATA ACTAATGATT CGCAGGTACT 50 

ATGTCGTGAG TATCCGGTCC TTTTTTGTTA ATTTACCAGA AATGACGAAA 100 

TTTTTGACTA ATTAACACAC TCAGACCGTC ATTGACGCCC CGGGTCACCG 150 

4 0 TGACTTCATC AAGAACATGA TCACTGGTAC CTCCCAGGCC GACTGCGCTG 200 

TCCTCATCAT TGCTGCCGGT ACTGGTGAGT TCGAGGCTGG TATCTCCAAG 250 

GATGGCCAGA CCCGTGAGCA CGCCCTGCTC GCCTACACCC TCGGTGTCAA 300 

GCAGCTCATC GTCGCCATCA ATAAGATGGA CACCACCAAG TGGTCCGAGG 350 

AGCGTTACGG CGAGATCATC AAGGAGACCT CTGCCTTCAT CAAGAAGGTC 400 

4 5 GGTTTCAACC CGAAGCACGT CCCGTTCGTC CCGATCTCCG GTTTCAACGG 450 

TGACAACATG ATCGAGGCCT YCACCAACTG CCCGTGGTAC AAGGGCTGGG 500 

AGAAGGAGAC CAAGGCCAAG GTCACCGGCA AGACCCTYCT TGAGGCCATT 550 

GACAACATCG ACCCGCCGAG CCGTCCTTCC GACAAGCCGC TCCGTCTTCC 600 

CCTCCAGGAT GTCTACAAGA TCGGTGGTAT TGGGACAGTC CCAGTCGGCC 650 

50 GTGTCGAGAC CGGTACCATC AAGGCCGGCA TGGTCGTTAC CTTCGCTCCG 700 

GCTGGTGTCA CCACTGAAGT GAAGTCCGTT GARATGCACC ACGAGCAGCT 750 

CGCTGAGGGT YTGCCGGGTG ACAACGTCGG CTTCAACGTC AAGAACGTTT 800 

CCGTCAAGGA GATCCGTCGT GGCAACGTTG CTGGTGACAG CAAGGCTGAC 850 
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CCGCCGAAGG GCTGCGACAG 
CCCTGGCCAG GTCGGTGCTG 
CCCACATTGC CTGCAAGTTC 
TYTGGCAAGT CCATTGAAGC 
5 TGCCATYGTC AAGATGATTC 
CTGAGT 



CTTCAACGCC CAGGTCATCG 
GTTACGCTCC AGTCCTGGAC 
GGCGAGCTCC TCGAGAAGAT 
CTYGCCTAAG TACATCAAGT 
CGTCCAAGCC GATGTGCGTT 



PCT/CAOO/01150 

TCCTGAACCA 900 
TGCCACACTG 950 
CGACCGTCGC 1000 
CTGGTGACGC 1050 
GAGCCATTCA 1100 
1106 



10 2) INFORMATION FOR SEQ ID NO: 1274 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1119 bases 

(B) TYPE: • Nucleic acid 
15 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

20 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Fusarium solani 

(B) STRAIN: ATCC 32793 



25 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1274 



CTCAAGGCCG AGCGTGAGCG TGGTATCACC ATCGACATTG CCCTCTGGAA 50 

GTTCGAGACT CCCCGCTACT ATGTCACCGT CATTGGTATG TTGCTGTCAC 100 

CTCTCTCACA CATGTCTCAC CACTAACAAT CAACAGACGC CCCCGGCCAC 150 

CGTGATTTCA TCAAGAACAT GATCACTGGT ACTTCCCAGG CCGACTGCGC 200 

30 CATTCTCATC ATTGCCGCCG GTACTGGTGA GTTCGAGGCT GGTATCTCCA 250 

AGGATGGCCA GACCCGTGAG CACGCCCTGC TCGCCTACAC CCTCGGTGTC 300 

AAGAACCTCA TTGTCGCCAT CAACAAGATG GACACCACCA AGTGGTCCGA 350 

GTCCCGTTAC CAGGAGATCA TCAAGGAGAC CTCCAACTTC ATCAAGAAGG 400 

TCGGCTACAA CCCCAAGGCT GTCGCTTTCG TCCCCATCTC CGGTTTCAAC 4 50 

35 GGCGACAACA TGCTTACTCC CTCCACCAAC TGCCCCTGGT ACAAGGGCTG 500 

GGAGCGTGAG ATCAAGTCCG GCAAGCTCAC TGGCAAGACC CTCCTCGAGG 550 

CCATTGACTC CATCGAGCCC CCCAAGCGTC CCGTCGACAA GCCCCTYCGA 600 

CTTCCCCTCC AGGATGTCTA CAAGATCGGT GGTATTGGCA CGGTTCCCGT 650 

CGGCCGTATC GAGACTGGTG TCATCAAGCC CGGTATGGTC GTTACCTTMG 700 

40 CCCCCTCCAA CGTCACCACT GAAGTCAAGT CCGTCGARAT GCACCACGAG 750 

CAGCTCTYTG AGGGTCTTCC CGGTGACAAC GTCGGCTTCA ACGTGAAGAA 800 

CGTYTCCGTC AAGGAGATCC GACGTGGCAA CGTCGCTGGT GACTCCAAGA 850 

ACGACCCCCC TYTGGGTGCC GCCTCTTTCA CCGCCCAGGT CATTGTCCTC 900 

AACCACCCTG GCCAGGTCGG TGCCGGTTAC GCCCCCGTTY TGGACTGCCA 950 

45 CACTGCCCAC ATTGCCTGCA AGTTCGCCGA GATCCAGGAG AAGATCGACC 1000 

GCCGAACTGG TAAGGCTGTT GAGTCCGCCC CCAAGTTCAT CAAGTCTGGT 1050 

GACTCCGCCA TCGTCAAGAT GGTTCCCTCC AAGCCCATGT GCGTTGAGGC 1100 

TTTCACTGAC TACCCCCCT 1119 



2) INFORMATION FOR SEQ ID NO: 1275 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1113 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
5 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
10 (A) ORGANISM: Aureobasidium pullulans 

(B) STRAIN: WSA-234 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1275 

15 CTGAAGTCTG AGCGTGAGCG TGGTATCACT ATCGATATCG CCCTCTGGAA 50 

GTTCGAGACC CCCAAGTACA TGGTCACCGT CATCGATGCC CCCGGTCACC 100 

GTGATTTCAT CAAGAACATG ATCACTGGTA CCTCCCAGGC TGACTGCGCC 150 

ATTCTCATCA TTGCTGCCGG TACTGGTGAG TTCGAGGCTG GTATCTCCAA 200 

GGATGGCCAG ACTCGTGAGC ACGCTCTCCT CGCCTACACC CTTGGTGTCA 250 

20 AGCAGCTCAT CGTTGCCATC AACAAGATGG ACACCACCAA GTGGTCTGAG 300 

GCCCGTTACC AGGAGATCAT CAAGGAGACC TCCGGTTTCA TCAAGAAGGT 350 

CGGCTACAAC CCCAAGCACG TTCCCTTCGT CCCCATCTCC GGCTTCAACG 4 00 

GAGACAACAT GATCGAGGTC AGCTCCAACT GCCCTTGGTA CAAGGGTTGG 450 

GAGAAGGAGA CCAAGGCCAA GGCCACTGGC AAGACCCTCC TCGAGGCCAT 500 

25 TGACGCCATC GACCCTCCTT CCCGTCCTAC CGACAAGCCC CTCCGTCTTC 550 

CCCTCCAGGA TGTCTACAAG ATCGGTGGTA TTGGCACGGT GCCCGTCGGT 600 

CGTGTTGAGA CCGGTAAGAT CATGGGTGGT ATGGTTGTCA CCTTCGCCCC 650 

CGCTGGTGTC ACCACCGAGG TCAAGTCCGT CGAGATGCAC CACGAGCAGC 700 

TCACCGAGGG TCTT.CCCGGT GACAACGTCG GCTTCAACGT CAAGAACGTY 7 50 

30 TCCGTCAAGG AGATCCGTCG TGGCAACGTT GCCGGTGACT CCAAGAACGA 800 

CCCCCCCAAG GGTTGTGACT CCTTCAACGC CCAGGTCATC GTCCTGAACC 850 

ACCCTGGTCA GGTCGGTGCT GGTTACGCAC CCGTCCTCGA CTGCCACACT 900 

GCCCACATTG CTTGCAAGTT YTCCGAGCTT GTTGAGAAGA TTGACCGCAG 950 

AACCGGCAAG TCCGTTGAGG CTGCCCCCAA GTTCATCAAG TCTGGTGACG 1000 

35 CCGCCATCGT CAAGATGGTT CCCTCCAAGC CCATGTGTGT TGAGGCTTTC 1050 

ACCGACTACC CTCCTYTCGG TCGTTTCGCC GTCCGGTATG TTTTTCTTCT 1100 

TCAATCATTC TTC 1113 



40 



50 



2) INFORMATION FOR SEQ ID NO: 1276 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 731 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Blastomyces dermatitidis 

(B) STRAIN: ATCC 14112 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1276 



GGTCTGAGAC 
AAGAAGGTCG 
TTTCGAGGGT 
AGGGCTGGAA 
CTCGATGCCA 
TCTCCGTCTT 
TTCCCGTTGG 
ACCTTCGCCC 
CCACCAGCAG 
TCAAGAACGT 
TCCAAGAACG 
CGTCCTCAAC 
ACTGCCACAC 
ATTGACCGCC 
GTCCGGTGAT 



CCGTTTCAAC 
GATACAACCC 
GACAACATGA 
CAAGGAGACT 
TTGACGCCAT 
CCCCTCCAGG 
ACGTGTCGAG 
CTGCCAACGT 
CTCCAGGCTG 
TTCCGTCAAG 
ACCCCCCCAA 
CACCCCGGCC 
TGCCCACATT 
GTACCGGAAA 
GCTGCTATCG 



GAAATTATCA AGGAAGTCAG CAACTTCATC 
CAAGTCTGTT CCCTTCGTGC CAATCTCCGG 
TTGAGCCTTC CCCCAACTGC CCTTGGTACA 
GCTGCTGGCA AGGCCGCCGG TAAGACTCTT 
CGACCCCCCC GTCCGTCCTA CCGAGAAGCC 
ATGTGTACAA GATCTCCGGT ATTGGCACTG 
ACTGGTGTCA TCAAGCCTGG TATGGTCGTG 
CACCACTGAA GTCAAGTCCG TTGAAATGCA 
GT'TACCCTGG TGACAACGTC GGCTTCAACG 
GAAGTCCGCC GTGGCAACGT TGCCGGTGAC 
GGGCTGCGAG' TCCTTCAACG CCCAGGTCAT 
AGGTTGGCGC TGGTTATGCC CCAGTCCTCG 
GCTTGCAAGT TCTCTGAGCT CATCGAGAAG 
GTCTGTTGAG GACAACCCCA AGTTCATCAA 
TCAAGATGAT T 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
731 



2) INFORMATION FOR SEQ ID NO: 1277 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1046 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Exophiala dermatitidis 

(B) STRAIN: ATCC 76088 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1277 

CCGAGCGTGA GCGTGGTATC ACCATCGATA TCGCCCTCTG GAAGTTCGAG 50 

ACCCCCAAGT ACTATGTCAC CGTCATCGAC GCCCCGGGTC ATCGTGACTT 100 

TATCAAGAAC ATGATCACTG GTACCTCGCA GGCCGACTGC GCCATCTTGA 150 

TCATTGCCGC CGGTACCGGT GAATTCGAAG CCGGTATCTC CAAGGATGGT 200 

CAGACCCGTG AGCACGCTCT GCTCGCCTAC ACCTTGGGTG TCAAGCAGCT 250 

CATCGTCGCC ATCAACAAGA TGGACACCAC CAAGTGGTCC GAGGAGCGTT 300 

TCAACGAAAT CATCAAGGAG ACTTCCAACT TCATCAAGAA GGTCGGCTAC 350 

AACCCCAAGG CCGTTCCTTT CGTCCCCATC TCCGGCTTCA ACGGTGACAA 400 

CATGATTGAG GTCTCCACCA ACTGCCCGTG GTACAAGGGA TGGGAGAAGG 450 

AGTCCAAGGC TGGCAAGGCC ACCGGCAAGA CCCTCCTCGA GGCCATTGAC 500 

GCCATCGACC CACCCACCCG TCCCACCGAC AAGCCTCTCC GTCTCCCTCT 550 

CCAGGATGTC TACAAGATCT CTGGTATCGG AACGGTTCCT GTCGGTCGTG 600 

TCGAGACCGG TACCATCAAG GCCGGTATGG TCGTCACCTT CGCTCCGGCC 650 

AACGTCACCA CTGAAGTCAA GTCCGTCGAA ATGCACCACG AGCAGCTCGC 700 

CGAGGGTYTG CCAGGTGACA ACGTTGGCTT CAACGTCAAG AACGTYTCCG 750 

TCAAGGAGGT TCGTCGTGGT AACGTTGCCG GTGACTCCAA GAACGACCCG 800 



692 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

CCCAAGGGTG CCGAGTCCTT CAACGCCCAG 

TGGTCAGATC GGTGCCGGCT ACGCTCCAGT 

ACATTGCTTG CAAGTTCGCC GAGTTGCTCG 

GGAAAGTCCA TCGAGAACAA CCCCAAGTTC 

5 CATCGTCAAG ATGATTCCCA GCAAGCCCAT 



PCT/CA00/01150 

GTCATTGTCC TCAACCACCC 850 

CTTGGATTGC CACACTGCCC 900 

AGAAGATCGA CCGTCGTACC 950 

ATCAAGTCTG GTGATGCTGC 1000 

GTGTGTCGAG GCTTTC 104 6 



10 



2) INFORMATION FOR SEQ ID NO: 1278 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1109 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
15 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 
20 (A) ORGANISM: Fusarium moniliforme 

(B) STRAIN: WSA-213 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1278 



25 CGTGAGCGTG GTATCACCAT CGATATTGCT CTCTGGAAGT TCGAGACTCC 50 

TCGCTACTAT GTCACCGTCA TTGGTATGTT GTCGCTCATA CCTCATCCTA 100 

CTTCCTCATA CTAACACATC ATTCAGACGC TCCCGGTCAC CGTGATTTCA 150 

TCAAGAACAT GATCACTGGT ACTTCCCAGG CCGATTGCGC CATTCTCATC 200 

ATTGCCGCCG GTACTGGTGA GTTCGAGGCT GGTATCTCCA AGGATGGCCA 250 

30 GACCCGTGAG CACGCTCTTC TTGCCTACAC CCTTGGTGTC AAGAACCTCA 300 

TCGTCGCCAT CAACAAGATG GACACCACCA AGTGGTCTGA GGCCCGTTAC 350 

CAGGAGATCA TCAAGGAGAC CTCCTCTTTC ATCAAGAAGG TCGGCTACAA 4 00 

CCCCAAGGCT GTCGCTTTCG TCCCCATCTC CGGTTTCAAC GGTGACAACA 450 

TGCTTACCCC CTCCACCAAC TGCCCCTGGT ACAAGGGTTG GGAGCGTGAG 500 

35 ATCAAGTCCG GCAAGCTCTC CGGCAAGACC CTCCTCGAGG CCATTGACTC 550 

CATCGAGCCT CCCAAGCGTC CCGTTGACAA GCCCCTCCGT CTTCCCCTCC 600 

AGGATGTCTA CAAGATCGGT GGTATTGGAA CGGTTCCCGT CGGCCGTATT 650 

GAGACTGGTG TCATCAAGCC CGGTATGGTC GTTACCTTCG CTCCCTCCAA 7 00 

CGTCACCACT GAAGTCAAGT CCGTCGARAT GCACCACGAG CAGYTCAGTK 750 

4 0 AGGGCCAGCC CGGTGACAAC GTTGGTTTCA ACGTGAAGAA CGTTTCCGTC 800 

AAGGACATCC GACGTGGTAA CGTCGCTGGT GACTCCAAGA ACGACCCCCC 850 

CCAGGGTGCC GCTTCTTTCA CCGCCCAGGT CATCGTCCTC AACCACCCCG 900 

GCCAGGTCGG TGCTGGTTAC GCTCCTGTCC TCGATTGCCA CACTGCCCAC 950 

ATTGCCTGCA AGTTCGCCGA GATCCAGGAG AAGATCGACC GCCGAACCGG 1000 

4 5 TAAGGCTACT GAGGCCGCTC CCAAGTTCAT CAAGTCTGGT GACTCCGCCA 1050 

TCGTCAAGAT GGTTCCCTCC AAGCCCATGT GTGTCGAGGC TTTCACTGAC 1100 

TACCCTCCT 1109 



50 

2) INFORMATION FOR SEQ ID NO: 1279 

(i) SEQUENCE CHARACTERISTICS: 

693 

SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

(A) LENGTH: 765 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aspergillus 

(B) STRAIN: WSA-174 

(xi) SEQUENCE DESCRIPTION: SEQ 

CCTGCAAGTG GTCTGAGGAC CGTTACAACG 
AACTTCATCA AGAAGGTCGG CTACAACCCC 
CATCTCCGGT TTCAACGGTG ACAACATGCT 
CCTGGTACAA GGGTTGGGAG AAGGAGGGCA 
AAGACTCTCC TCGAGGCCAT CGATGCCATC 
CAACAAGCCT CTCCGTCTTC CCCTCCAGGA 
TCGGAACTGT CCCCGTCGGC CGTGTCGAGA 
ATGGTTGTTA CCTTCGCTCC TTCCAACGTC 
TGAGATGCAC CACCAGCAGC TCAAGGAGGG 
GTTTCAACGT CAAGAACGTC TCCGTCAAGG 
GCTGGTGACT CCAAGAACGA CCCCCCTGCT 
CCAGGTCATC GTTCTCAACC ACCCCGGTCA 
CCGTCCTCGA CTGCCACACT GCCCACATTG 
CAGGAGAAGA TTGACCGCCG TACCGGAAAG 
GTTCATCAAG TCTGGTGATG CTGCCATCGT 
CCATGTGCGT CGAAG 



PCT/CAOO/01150 



terreus 



ID NO: 1279 



AAATCGTGAA 


GGAGACCTCC 


50 


AAGGCCGTTC 


CCTTCGTCCC 


100 


TGAGCCTTCC 


CCCAACTGCC 


150 


AGTCCGGCAA 


GGTCACCGGT 


200 


GAGCCCCCCG 


TCCGTCCTGC 


250 


TGTGTACAAG 


ATCTCTGGTA 


300 


CTGGTGTCAT 


CACCCCCGGC 


350 


ACCACTGAAG 


TGAAGTCCGT 


400 


TCTCCCCGGT 


GACAACGTTG 


450 


AGGTCCGTCG 


TGGTAACGTC 


500 


GGCGCTGCCT 


CCTTCACCGC 


550 


GGTCGGCGCT 


GGCTACGCCC 


600 


CCTGCAAGTT 


CGCTGAGCTC 


650 


TCTGTTGAGT 


CTTCTCCCAA 


700 


CAAGATGATC 


CCTYCAAAGC 


750 






765 



2) INFORMATION FOR SEQ ID NO: 1280 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1105 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aspergillus fumigatus 

(B) STRAIN: ATCC 64746 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1280 

GCGTGGTATC ACCATCGACA TTGCCCTCTG GAAGTTCCAG ACTCCCAAGT 50 

ATGAGGTCAC TGTCATCGGT AAGCTCGACT CGCCCCGATA TGTTTTGGTG 100 

CTGTAGCTAA CACGATCTGA AGATGCCCCC GGTCACCGTG ACTTCATCAA 150 

GAACATGATC ACTGGTACCT CCCAGGCTGA CTGCGCTATC CTCATCATTG 200 

CCTCCGGTAC TGGTGAGTTC GAGGCTGGTA TCTCCAAGGA TGGCCAGACC 250 

694 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CAOO/01150 



CGTGAGCACG CTCTGCTGGC TTTCACCCTC GGTGTCAAGC AGCTCATCGT 300 

CGCCCTCAAC AAGATGGACA CCTGCAAGTG GTCCGAGGAT CGTTACAACG 350 

AAATTGTCAA GGAAACCTCC AACTTCATCA AGAAGGTCGG CTACAACCCC 400 

AAGGCCGTTC CCTTCGTMCC CATCKCTGGC TTCAACGGTG ACAACATGCT 450 

5 TGAGCCCTCC TCCAACTGCC CCTGGTACAA GGGATGGGAG AAGGAGACCA 500 

AGGCCGGCAA GGTCACTGGT AAGACCCTCA TCGAGGCCAT YGACGCCATT 550 

GAGCCCCCTG TCCGTCCCTC CAACAAGCCC CTYCGTCTTC CCCTCCAGGA 600 

TGTYTACAAG ATCTCTGGTA TCGGAACGGT CCCTGTCGGC CGTGTCGAGA 650 

CCGGTATCAT CAAGCCCGGC ATGGTCGTCA CCTTTGCCCC CGCCAACGTC 700 

10 ACCACTGAAG TCAAGTCCGT CGAAATGCAC CACCAGCAGC TCCAGGAGGG 750 

TGTCCCCGGT GACAACGTCG GTTTCAACGT CAAGAACGTT TCCGTCAAGG 800 

AAGTCCSCCG TGGTAACGTT TGCGGTGACT CCAAGAACGA TCCCCCTCAG 850 

GGTGCTGCCT CCTTCAACGC CCAGGTCATC GTCCTCAACC ACCCCGGTCA 900 

GGTCGGCGCT GGTTACGCCC CCGTCCTCGA CTGCCACACT GCCCACATTG 950 

15 CTTGCAAGTT CTYTGAGCTG CTTGAGAAGA TTGACCGCCG TACCGGCAAG 1000 

TCTGTTGAGA ACAACCCCAA GTTCATCAAG TCCGGTGAWG CCGCCATCGT 1050 

GAAGATGGTT CCTTCCAAGC CCATGTGTGT CGAGTCCTTC ACTGACTACC 1100 

CCCCT 1105 



20 



35 



2) INFORMATION FOR SEQ ID NO: 1281 



(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 1343 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

30 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cryptococcus laurentii 

(B) STRAIN: ATCC 44096 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1281 



GCCGAGCGAG AGCGAGGTAT CACCATCGAC ATTGCTCTCT GGAAGTTCGA 50 

GACCCCCAAG TACAACGTCA CCGTCATTGA CGCCCCCGGA CACCGAGACT 100 

40 TCATCAAGAA CATGATCACC GGTACCTCCC AGGCCGACTG CGCCATCCTT 150 

ATCATTGCCA CCGGTATCGG AGAGTTCGAG GCCGGTATCT CTAAGGACGG 200 

TCAGACCCGA GAGCACGCTT TGCTCGCCTT CACCCTCGGT GTCAGGCAGC 250 

TCATCGTTGC TTGCAACAAG ATGGACACCT GCAAGTGGTC CGAGGACCGA 300 

TTCAACGAGA TCGTCAAGGA GACCAACGGT TTCATCAAGA AGGTCGGATA 350 

45 CAACCCCAAG GCTGTTCCCT TCGTCCCCAT CTCTGGATGG CACGGAGACA 4 00 

AGTGAGTGCC GTTCTTTGCG TTGAGCCCTC TTTGTCGCTC CCCCTCCCTC 4 50 

TCAAGTGGCG GCGGCGGTCT CCACCCACAA ATCGGGTGGC GAATCCGCCA 500 

CACCCACCAC TTCTCGCCAC CGAGTGTGGC ACTTCTTCCA ACTCCTCTTT 550 

CCACTCCTCC TCGTCTCGTC TCTTTTTTTC TCCGTTGTCT TTGACAAGGG 600 

50 GAGTGTGCTG ATAGTAAAGC ATGCTTGAGG AGACCACCAA CATGCCGTGG 650 

TACAAGGGAT GGACCAAGGA GACCAAGTCC GGTGTCGTTA AGGGTAAGAC 700 

CCTCCTCGAC GCCATCGACG CCATCGAGCC TCCTCAACGA CCCACCGACA 750 

AGCCCCTCCG ACTTCCCCTC CAGGATGTCT ACAAGATCGG TGGTATCGGT 800 

695 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

ACGGTGCCCG TCGGCCGAGT 
CACGGGAGCC TGACGGCTGT 
GTCGTCACCT TCGCTCCTAC 
GATGCACCAC GAGCAGATCC 
5 TCAACGTGAA GAACGTTTCC 
TCCGACTCCA AGAACGACCC 
GGTCATTGTC CTCAACCACC 
TCCTCGACTG CCACACCGCC 
GAGAAGATCG ACCGACGAAC 
10 CGTCAAGTCC GGAGACGCCG 
TCTGTGTCGA GTCTTACTCT 



CGAGACCGGT GTCATCAAGG 
TGTTGCCACA CCCAACTTAT 
CAACGTCACC ACTGAGGTTA 
CTGAGGGTCT TCCCGGAGAC 
ATCAAGGACA TCCGACGAGG 
CGCTAAGGAG GCCGCTTCTT 
CTGGACAGAT TGGTGCCGGT 
CACATTGCCT GCAAGTTCGC 
TGGTAAGACC ATGGAGGCCG 
CCATTGTCAA GCTCGTTGCC 
GACTACCCTC CCCTTGGACG 



PCT/CAOO/01150 

CCGGGTAAGT 850 

ATCCAGTATG 900 

AGTCCGTTGA 950 

AACGTTGGTT 1000 

AAACGTYTGC 1050 

TCAACGCCCA 1100 

TACACCCCCG 1150 

CGAGCTCATC 1200 

CCCCCAAGTT 1250 

CAGAAGCCCA 1300 

ATT 134 3 



15 2) INFORMATION FOR SEQ ID NO: 1282 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 734 bases 

(B) TYPE: Nucleic acid 
20 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

25 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Emmonsia parva 

(B) STRAIN: ATCC 10784 



30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1282 



TGGTCCGAGG CTCGTTTCAA CGAAATCATC AAGGAAGTCA CCAACTTCAT 50 

CAAGAAGGTC GGATACAACC CCAAGTCTGT TCCCTTCGTG CCAATCTCTG 100 

GTTTCGAGGG TGATAACATG ATTGAGCCCT CCACCAACTG CCCCTGGTAC 150 

AAGGGCTGGA CCAAGGAGAC CGCTGCCGGC AAGTCAACCG GTAAGACTCT 200 

3 5 TCTTGACGCC ATTGATGCCA TCGATCAACC CTCCCGTCCT ACCGACAAGC 250 

CCCTCCGTCT TCCCCTCCAG GATGTGTACA AGATCTCCGG TATTGGCACT 300 

GTTCCCGTTG GACGTGTTGA GACTGGTATC ATCAAGCCTG GTATGGTCGT 350 

GACCTTCGCT CCCTCCAACG TCACCACTGA AGTCAAGTCC GTCGAAATGC 4 00 

ACCACCAACA GCTCTTGGCT GGTAACCCCG GTGACAACGT CGGTTTCAAC 4 50 

40 GTCAAGAACG TTTCCGTCAA GGAAGTCCGC CGTGGCAACG TTGCTGGTGA 500 

CTCAAAGAAC GACCCCCCCA AGGGCTGCGA CTCCTTCAAC GCCCAGGTCA 550 

TCGTCCTCAA TCACCCCGGT CAAGTTGGCG CTGGTTATGC CCCAGTCCTC 600 

GACTGCCACA CTGCCCACAT TGCTTGCAAG TTCTCTGAGC TCCTCGAGAA 650 

GATTGACCGC CGTACCGGAA AGTCCACTGA GAACAACCCC AAGTTCATCA 700 

45 AGTCTGGTGA CGCCGCTATC GTCAAGATGG TTCC 734 



2) INFORMATION FOR SEQ ID NO: 1283 

50 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1107 bases 

(B) TYPE: Nucleic acid 

696 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CAOO/01150 



(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Fusarium solani 

(B) STRAIN: ATCC 62877 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1283 



CGTGAGCGTG GTATCACCAT 
CCGCTACTAT GTCACCGTCA 
TGTCTCACCA CTAACAATCA 
AAGAACATGA TCACTGGTAC 
TGCCGCTGGT ACTGGTGAGT 
CCCGTGAGCA CGCCCTGCTC 
GTCGCCATCA ACAAGATGGA 
GGAGATCATC AAGGAGACCT 
CCAAGGCTGT CGCTTTCGTC 
CTTACTCCCT CCACCAACTG 
CAAGTCCGGC AAGCTCACTG 
TCGAGCCCCC CAAGCGTCCC 
GATGTCTACA AGATCGGTGG 
GACTGGTGTC ATCAAGCCCG 
TCACCACTGA AGTCAAGTCC 
GGTCTTCCCG GTGACAACGT 
GGAGATCCGA CGTGGCAACG 
TGGGTGCCGC CTCTTTCACC 
CAGGTCGGTG CCGGTTACGC 
TGCCTGCAAG TTCGCCGAGA 
AGGCTGTTGA GTCCGCCCCC 
GTCAAGATGG TTCCCTCCAA 
CCCCCCT 



CGACATTGCC CTCTGGAAGT 
TTGGTATGTT GCTGTCACCT 
ACAGACGCCC' CCGGCCACCG 
TTCCCAGGCC GACTGCGCCA 
TCGAGGCTGG TATCTCCAAG 
GCCTACACCC TCGGTGTCAA 
CACCACCAAG TGGTCCGAGT 
CCAACTTCAT CAAGAAGGTC 
CCCATCTCCG GTTTCAACGG 
CCCCTGGTAC AAGGGCTGGG 
GCAAGACCCT CCTCGAGGCC 
GTCGACAAGC CCCTCCGACT 
TATTGGCACG GTTCCCGTCG 
GTATGGTCGT TACCTTCGCC 
GTCGAGATGC ACCACGAGCA 
CGGCTTCAAC GTKAAGAACG 
TCGCTGGTGA CTCCAAGAAC 
GCCCAGGTCA TTGTCCTCAA 
CCCCGTTYTG GACTGCCACA 
TCCAGGAGAA GATCGACCGC 
AAGTTCATCA AGTCTGGTGA 
GCCCATGTGC GTTGAGGCTT 



TCGAGACTCC 


50 


CTCTCACACA 


100 


TGACTTCATC 


150 


TTCTCATCAT 


200 


GATGGCCAGA 


250 


GAACCTCATT 


300 


CCCGTTACCA 


350 


GGCTACAACC 


400 


CGACAACATG 


450 


AGCGTGAGAT 


500 


ATTGACTCCA 


550 


TCCCCTCCAG 


600 


GCCGTATCGA 


650 


CCCTCCAACG 


700 


GCTCTYTGAG 


750 


TYTCCGTCAA 


800 


GACCCCCCTY 


850 


CCACCCTGGC 


900 


CTGCCCACAT 


950 


CGAACTGGTA 


1000 


CTCCGCCATC 


1050 


TCACTGACTA 


1100 




1107 



2) INFORMATION FOR SEQ ID NO: 1284 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1045 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Sporothrix schenckii 

(B) STRAIN: ATCC 14285 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1284 



697 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CAOO/01150 

GCTCAAGGCC GAGCGTGAGC GCGGTATCAC CATCGATATT GCTCTGTGGA 50 

AGTTCGAGAC CCCCAAGTAC TACGTCACCG TCATTGACGC CCCCGGTCAT 100 

CGCGATTTCA TCAAGAACAT GATCACTGGT ACCTCGCAGG CCGACTGCGC 150 

CATTCTCATC ATTGCCGCTG GTACTGGTGA GTTCGAGGCT GGTATCTCCA 200 

5 AGGATGGCCA GACTCGTGAG CACGCTCTGC TCGCCTACAC CCTGGGTGTG 250 

CGGCAGCTGA TCGTCGCCAT CAACAAGATG GACACGGCCA AGTGGGCTGA 300 

GGCTCGTTAC CAGGAGATCA TCAAGGAGAC CTCCAACTTC ATCAAGAAGG 350 

TCGGCTACAA CCCCAAGACT GTTGCCTTCG TCCCCATCTC GGGCTTCCAC 400 

GGCGACAACA TGCTTACTCC CTCGACCAAC TGCCCCTGGT ACAAGGGCTG 4 50 

10 GGAGAAGGAG GGCAAGAGCG GCAAGGTTAC CGGTAAGACT CTGCTGGACG 500 

CCATTGACGC CGTCGAGCCC CCCAAGCGCC CCACGGACAA GCCCCTGCGT 550 

CTGCCCCTCC AGGATGTCTA CAAGATCGGC GGTATCGGCA CTGTCCCTGT 600 

CGGCCGTATC GAGACTGGTG TCCTGAAGCC CGGCATGGTC GTCACCTTTG 650 

CCCCGTCCAA CGTCACCACT GAAGTCAAGT CCGTCGAGAT GCACCACGAG 700 

15 CAGCTTGTTG AGGGTGTTCC CGGCGACAAC GTCGGCTTCA ACGTCAAGAA 750 

CGTYTCCGTC AAGGAGATCC GTYGTGGCAA CGTTGCCGGT GACTCCAAGA 800 

ACGACCCCCC CTCGGGCGCC GCCACCTTCA ACGCCCAGGT CATTGTCCTG 850 

AACCACCCCG GCCAGGTCGG CAACGGCTAC GCCCCGGTTY TGGACTGCCA 900 

CACCGCCCAC ATTGCCTGCA AGTTCACCGA GATCCTTGAG AAGATCGACC 950 

2 0 GCCGTACCGG CAAGTCGGTT GAGAACAACC CCAAGTTCAT CAAGTCGGGT 1000 

GACGCCGCCA TTGTCAAGCT GACGCCCTYG AAGCCCATGT GCGTT 1045 



25 2) INFORMATION FOR SEQ ID NO: 1285 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 764 bases 

(B) TYPE: Nucleic acid 
30 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

35 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aspergillus nidulans 

(B) STRAIN: WSA-176 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1285 



TTGCCAGTGG TCCGAGGCCC GTTACAACGA 
GTTTCATCAA GAAGGTCGGA TACAACCCCA 
ATCTCCGGTT TCAACGGTGA CAACATGCTC 
CTGGTACAAG GGTTGGGAGA AGGAGACCAA 

4 5 AGACCCTCCT TGAGGCCATC GACGCCATTG 
AACAAGCCCC TCCGTCTTCC CCTCCAGGAT 
TGGAACTGTC CCCGTCGGCC GTGTCGAGAC 
TGGTCGTCAC CTTCGCTCCT GCCAACGTCA 
GAGATGCACC ACCAGCAGCT CAAGGAGGGT 

50 TTTCAACGTC AAGAACGTTT CCGTCAAGGA 
CCTCCGACTC CAAGAACGAC CCCGCCTCCG 
CAGGTCATCG TTCTCAACCA CCCCGGTCAG 
CGTCCTCGAC TGCCACACCG CCCACATTGC 

698 



AATCGTCAAG 


GAGACTTCCG 


50 


AGTCCGTTGC 


CTTCGTCCCC 


100 


GAGGCCTCTA 


CCAACTGCCC 


150 


GGCCGGTAAG 


GCCACTGGTA 


200 


AGCCCCCCAC 


CCGTCCCTCC 


250 


GTCTACAAGA 


TCTCCGGTAT 


300 


TGGTGTTATC 


ACCCCCGGCA 


350 


CCACTGAAGT 


CAAGTCCGTT 


400 


GTCCCCGGTG 


ACAACGTCGG 


450 


AATCCGTCGT 


GGTAACGTTG 


500 


GCGCTGCCTC 


TTTCAACGCC 


550 


GTCGGTGCTG 


GTTACGCCCC 


600 


TTGCAAGTTC 


TCTGAGCTTC 


650 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



TTGAGAAGAT TGACCGCCGT ACCGGAAAGG CTGTTGAAAC CAGCCCCAAG 
TTCATCAAGT CCGGTGACGC TGCCATCGTC AAGATGATTC CTTCCAAGCC 
CATGTGCGKT CCGA 



700 
750 
764 



2) INFORMATION FOR SEQ ID NO: 1286 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 971 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cladophialophora carrionii 

(B) STRAIN: ATCC 16264 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1286 

ACCATCGATA TCGCGCTCTG GAAGTTCGAG ACTCCCAAGT ACTTCGTCAC 50 

CGTCATCGAT GCCCCTGGTC ATCGTGACTT CATCAAGAAC ATGATCACTG 100 

GTACCTCCCA GGCTGATTGT GCTATTCTCA TCATTGCCGC TGGTACTGGT 150 

GAGTTCGAGG CCGGTATCTC CAAGGATGGC CAGACCCGTG AGCATGCTCT 200 

GCTCGCCTAC ACCCTGGGCG TGAAGCAGCT TATCGTCGCC ATCAACAAGA 250 

TGGACACCAC CAAATGGTCT GAGGATCGTT TCAACGAAAT CATCAAGGAG 300 

ACTTCCAACT TCATCAAGAA GGTCGGATAC AACCCCAAGT CCGTTCCATT 350 

CGTGCCCATC TCCGGCTTCA ACGGTGACAA CAT GAT CG AC GTCTCCACCA 4 00 

ATGCCCCCTG GTACAAGGGC TGGGAAAAGG AGTCCAAGGC TGGCAAGGCC 4 50 

ACCGGCAAGA CCCTCCTTGA GGCTATCGAC TCCATCGACC CTCCTGCTCG 500 

TCCCACCGAC AAGCCTCTCC GTCTCCCACT CCAGGATGTC TACAAGATTT 550 

CTGGTATCGG CACGGTGCCC GTCGGTCGTG TTGAGACTGG TACCATCAAG 600 

GCCGGTATGG TCGTCACCTT TGCCCCCGCC AACGTCACCA CTGAAGTCAA 650 

GTCCGTCGAA ATGCACCACG AACAGCTYGC CGAGGGCGTT CCGGGTGACA 700 

ACGTCGGCTT CAACGTCAAG AACGTYTCCG TGAAGGAGGT TCGTCGTGGA 750 

AACGTTGCTG GTGACTCCAA GAACGACCCC CCCAAGGGTG CCGACTCCTT 800 

CAACGCCCAG GTCATCGTCC TCAACCACCC TGGTCAGGTC GGTGCTGGCT 850 

ACGCCCCGGT CTTGGATTGC CACACTGCCC ACATTGCCTG CAAGTTCTYT 900 

GAGCTCCTCG AGAAGATCGA TCGTCGKACC GGCAAGTCCA TGGAAAACAA 950 

CCCCAAGTTC ATCAAGTCTG G 971 



2) INFORMATION FOR SEQ ID NO: 1287 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 732 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



699 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CAOO/01150 
(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Exserohilum rostratum 
5 (B) STRAIN: WSA-215 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1287 

GGTCTGAGGA CCGTTACCAG GAGATCATCA AGGAGACCTC CAACTTCATC 50 

10 AAGAAGGTCG GCTACAACCC CAAGCACGTT CCCTTCGTCC CCATCTCCGG 100 

TTTCAACGGA GACAACATGA TCGAGGCCTC CAGCAACTGC CCCTGGTACA 150 

AGGGTTGGGA GAAGGAGACC AAGGCCAAGG CCACTGGTAA GACCCTCCTT 200 

GAGGCCATTG ACGCCATCGA CCCTCCCAGC CGTCCTACCG ACAAGCCCCT 250 

CCGTCTTCCC CTCCAGGATG TCTACAAGAT TGGTGGTATT GGCACGGTTC 300 

15 CCGTCGGTCG TGTCGAGACC GGTATCATCA AGGCCGGTAT GGTCGTCACC 350 

TTCGCCCCCG CTGGTGTCAC CACTGAAGTC AAGTCCGTCG AGATGCACCA 4 00 

CGAGCAGCTT ACCGAGGGTG TCCCCGGTGA CAACGTCGGC TTCAACGTCA 4 50 

AGAACGTCTC CGTCAAGGAG ATCCGTCGTG GTAACGTTGC CGGTGACTCC 500 

AAGAACGACC CCCCCAAGGG CTGCGAGTCT TTCAACGCTC AGGTCATTGT 550 

2 0 CCTCAACCAC CCTGGTCAGG TCGGTGCCGG TTACGCGCCA GTCCTCGACT 600 

GCCACACCGC CCACATTGCC TGCAAGTTCT CTGAGCTCCT CGAGAAGATT 650 

GACCGCCGTA CCGGAAAGTC TGTCGAAGCC TCTCCCAAGT TCATCAAGTC 700 

TGGTGACGCG GCCATCGTCA AGATGGTTCC CT 732 

25 

2) INFORMATION FOR SEQ ID NO: 1288 

(i) SEQUENCE CHARACTERISTICS: 
30 (A) ' LENGTH: 337 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

35 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus thvringiensis 

(B) STRAIN: HER 1236 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1288 



40 



AATGGATCCT GTATACGCAC AAAAATTAGG CGTTAACATA GATGAATTRC 50 

TATTATCACA GCCTGATACA GGGGAGCAAG GATTGGAAAT CGCGGAAGCA 100 

45 CTTGTACGAA GTGGTGCGGT TGACATTATC GTAATTGACT CTGTAGCAGC 150 

TCTTGTACCG AAAGCAGAGA TTGAAGGCGA TATGGGTGAC TCACACGTAG 200 

GTTTACAAGC ACGTTTAATG TCACAAGCAC TTMGTAAGCT TTCAGGAGCA 250 

ATCAACAAAT CARRARCAAT WGCAATCTTT ATTAACCAAA TTCGWGAAAA 300 

AGTTGGGGTT ATGTTCGGAA ACCCAGAAAC AACTCCA 337 

50 



2) INFORMATION FOR SEQ ID NO: 1289 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 336 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus thuringiensis 

(B) STRAIN: HER 1232 

(xi) SEQUENCE DESCRIPTION: SEQ' ID NO: 1289 

AATGGATCCT GTATATGCAC AAAAATTAGG CGTTAACATA GATGAATTRC 50 

TATTATCACA GCCTGATACA GGGGAGCAAG GATTGGAAAT CGCGGAAGCA 100 

CTTGTACGAA GTGGTGCGGT TGACATTATC GTAATTGACT CTGTAGCAGC 150 

TCTTGTACCG AAAGCAGAGA TTGAAGGCGA TATGGGTGAC TCACACGTAG 200 

GTTTACAAGC ACGTTTAATG TCACAAGCAC TTMGTAAGCT TTCAGGAGCA 250 

ATCAACAAAT CARAARCAAT TGCAATCTTT ATTAACCAAA TTCGTGAAAA 300 

AGTTGGGGTT ATGTTCGGAA ACCCAGAAAC AACTCC 336 



2) INFORMATION FOR SEQ ID NO: 1290 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1290 
GAYTAYGCIA TGISIGTIAT HGT 



2) INFORMATION FOR SEQ ID NO: 1291 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1291 
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2) INFORMATION FOR SEQ ID NO: 1292 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1292 
ARISCYTCIA RIATRTGIGC 



2) INFORMATION FOR SEQ ID NO: 1293 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1293 
ATGGCTGAAT TACCTCAATC 



2) INFORMATION FOR SEQ ID NO: 1294 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1294 
ATGATTGTTG TATATCTTCT TCAAC 



2) INFORMATION FOR SEQ ID NO: 1295 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1295 
CAGAAAGTTT GAAGCGTTGT 



2) INFORMATION FOR SEQ ID NO: 1296 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1296 
AACGATTCGT GAGTCAGATA 



2) INFORMATION FOR SEQ ID NO: 1297 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1297 
CGGTCAACAT TGAGGAAGAG CT 



2) INFORMATION FOR SEQ ID NO: 1298 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
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10 



20 



25 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1298 
ACGAAATCGA CCGTCTCTTT TTC 23 

2) INFORMATION FOR SEQ ID NO: 1299 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 2711 bases 
15 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus aureus 

(B) STRAIN: 601055 

(C) ACCESSION NUMBER: X71437 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1299 



ATGGCTGAAT TACCTCAATC AAGAATAAAT GAACGAAATA TTACCAGTGA 50 

AATGCGTGAA TCATTTTTAG ATTATGCGAT GAGTGTTATC GTTGCTCGTG 100 

30 CATTGCCAGA TGTTCGTGAC GGTTTAAAAC CAGTACATCG TCGTATACTA 150 

TATGGATTAA ATGAACAAGG TATGACACCG GATAAATCAT ATAAAAAATC 200 

AGCACGTATC GTTGGTGACG TAATGGGTAA ATATCACCCT CATGGTGACT 250 

CATCTATTTA TGAAGCAATG GTACGTATGG CTCAAGATTT CAGTTATCGT 300 

TATCCGCTTG TTGATGGCCA AGGTAACTTT GGTTCAATGG ATGGAGATGG 350 

35 CGCAGCAGCA ATGCGTTATA CTGAAGCGCG TATGACTAAA ATCACACTTG 400 

AACTGTTACG TGATATTAAT AAAGATACAA TAGATTTTAT CGATAACTAT 4 50 

GATGGTAATG AAAGAGAGCC GTCAGTCTTA CCTGCTCGAT TCCCTAATTT 500 

ATTAGCCAAT GGTGCATCAG GTATCGCGGT AGGTATGGCA ACGAATATTC 550 

CACCACATAA CTTAACAGAA TTAATCAATG GTGTACTTAG CTTAAGTAAG 600 

4 0 AACCCTGATA TTTCAATTGC TGAGTTAATG GAAGATATTG AAGGTCCTGA 650 

TTTCCCAACT GCTGGACTTA TTTTAGGTAA GAGTGGTATT AGACGTGCAT 700 

ATGAAACAGG TCGTGGTTCA ATTCAAATGC GTTCTCGTGC AGTTATTGAA 750 

GAACGTGGAG GCGGACGTCA ACGTATTGTT GTCACTGAAA TTCCTTTCCA 800 

AGTGAATAAG GCTCGTATGA TTGAAAAAAT TGCAGAGCTC GTTCGTGACA 850 

45 AGAAAATTGA CGGTATCACT GATTTACGTG ATGAAACAAG TTTACGTACT 900 

GGTGTGCGTG TCGTTATTGA TGTGCGTAAG GATGCAAATG CTAGTGTCAT 950 

TTTAAATAAC TTATACAAAC AAACACCTCT TCAAACATCA TTTGGTGTGA 1000 

ATATGATTGC ACTTGTAAAT GGTAGACCGA AGCTTATTAA TTTAAAAGAA 1050 

GCGTTGGTAC ATTATTTAGA GCATCAAAAG ACAGTTGTTA GAAGACGTAC 1100 

50 GCAATATAAC TTACGTAAAG CTAAAGATCG TGCCCATATT TTAGAAGGGT 1150 

TACGTATCGC ACTTGACCAT ATCGATGAAA TTATTTCAAC GATTCGTGAG 1200 

TCAGATACAG ATAAAGTTGC AATGGAAAGC TTGCAACAAC GCTTCAAACT 1250 

TTCTGAAAAA CAAGCTCAAG CTATTTTAGA CATGCGTTTA AGACGTCTAA 1300 
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CAGGTTTAGA GAGAAACAAA ATTGAAGCTG AATATAATGA GTTATTAAAT 1350 

TATATTAGTG AATTAGAAGC CATCTTAGCT GATGAAGAAG TGTTATTACA 1400 

GTTAGTTAGA GATGAATTGA CTGAAATTAG AGATCGTTTC GGTGATGAGC 14 50 

GTCGTACAGA AATTCAATTA GGTGGATTTG AAGACTTAGA GGACGAAGAC 1500 

5 TTAATTCCAG AAGAACAAAT AGTAATTACT TTGAGCCATA ATAACTACAT 1550 

TAAACGTTTG CCGGTATCTA CATATCGTGC TCAAAACCGT GGTGGTCGTG 1600 

GTGTTCAAGG TATGAATACA TTGGAAGAAG ATTTTGTCAG TCAATTGGTA 1650 

ACTTTAAGTA CACATGACCA TGTATTGTTC TTTACTAACA AAGGTCGTGT 1700 

ATACAAACTA AAAGGTTATG AAGTGCCTGA GTTATCAAGA CAGTCTAAAG 1750 

10 GTATTCCTGT AGTGAATGCT ATTGAACTTG GAAATGATGA AGTCATTAGT 1800 

ACAATGATTG CTGTTAAAGA CCTTGAAAGT GAAGACAACT TCTTAGTGTT 1850 

TGCAACTAAA CGTGGTGTTG TTAAACGTTC AGCATTAAGT AACTTCTCAA 1900 

GAATAAATAG AAATGGTAAG ATTGCGATTT CGTTCAGAGA AGATGATGAG 1950 

TTAATTGCAG TTCGTTTAAC AAGTGGTCAA' GAAGATATCT TGATTGGTAC 2000 

15 ATCACATGCA TCATTAATTC GATTCCCTGA ATCAACATTA CGTCCTTTAG 2050 

GCCGTACAGC AACGGGTGTG AAAGGTATTA CACTTCGTGA AGGTGACGAA 2100 

GTTGTAGGGC TTGATGTAGC TCATGCAAAC AGTGTTGATG AAGTATTAGT 2150 

AGTTACTGAA AATGGTTATG GTAAACGTAC GCCAGTTAAT GACTATCGTT 2200 

TATCAAATCG TGGTGGTAAA GGTATTAAAA CAGCTACGAT TACTGAGCGT 2250 

2 0 AATGGTAATG TTGTATGTAT CACTACAGTA ACTGGTGAAG AAGATTTAAT 2300 

GATTGTTACT AATGCCGGTG TCATTATTCG ACTAGATGTT GCAGATATTT 2 350 

CTCAAAATGG TCGTGCAGCA CAAGGTGTTC GCTTAATTCG CTTAGGCGAT 2 4 00 

GATCAATTTG TTTCAACGGT TGCTAAAGTA AAAGAGGATG CAGATGAAGT 2 450 

AAATGAAGAT GAACAATCTA CTGTATCTGA AGATGGTACT GAACAACAAC 2500 

25 GTGAAGCGGT TGTAAATGAT GAAACACCAG GAAATGCAAT TCATACTGAA 2550 

GTGATTGATT CAGAAGAAAA TGATGAAGAT GGACGTATTG AAGTAAGACA 2 600 

AGATTTCATG GATCGTGTTG AAGAAGATAT ACAACAATCA TCAGATGAAG 2650 

ATGAAGAATA A 2711 



30 



2) INFORMATION FOR SEQ ID NO: 1300 



(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 2628 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

40 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 

(B) STRAIN: K12 

45 (C) ACCESSION NUMBER: X57174 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1300 

ATGAGCGACC TTGCGAGAGA AATTACACCG GTCAACATTG AGGAAGAGCT 50 

50 GAAGAGCTCC TATCTGGATT ATGCGATGTC GGTCATTGTT GGCCGTGCGC 100 

TGCCAGATGT CCGAGATGGC CTGAAGCCGG TACACCGTCG CGTACTTTAC 150 

GCCATGAACG TACTAGGCAA TGACTGGAAC AAAGCCTATA AAAAATCTGC 200 

CCGTGTCGTT GGTGACGTAA TCGGTAAATA CCATCCCCAT GGTGACTCGG 250 
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CGGTCTATGA CACGATCGTC CGCATGGCGC AGCCATTCTC GCTGCGTTAT 300 

ATGCTGGTAG ACGGTCAGGG TAACTTCGGT TCTATCGACG GCGACTCTGC 350 

GGCGGCAATG CGTTATACGG AAATCCGTCT GGCGAAAATT GCCCATGAAC 4 00 

TGATGGCCGA TCTCGAAAAA GAGACGGTCG ATTTCGTTGA TAACTATGAC 4 50 

5 GGCACGGAAA AAATTCCGGA CGTCATGCCA ACCAAAATTC -CTAACCTGCT 500 

GGTGAACGGT TCTTCCGGTA TCGCCGTAGG TATGGCAACC AACATCCCGC 550 

CGCACAACCT GACGGAAGTC ATCAACGGTT GTCTGGCGTA TATTGATGAT 600 

GAAGACATCA GCATTGAAGG GCTGATGGAA CACATCCCGG AGCCGGACTT 650 

CCCGACGGCG GCAATCATTA ACGGTCGTCG CGGTATTGAA GAAGCTTACC 700 

10 GTACCGGTCG CGGCAAGGTG TATATCCGCG CTCGCGCAGA AGTGGAAGTT 750 

GACGCCAAAA CCGGTCGTGA AACCATTATC GTCCACGAAA TTCCGTATCA 800 

GGTAAACAAA GCGCGCCTGA TCGAGAAGAT TGCGGAACTG GTAAAAGAAA 850 

AACGCGTGGA AGGCATCAGC GCGCTGCGTG ACGAGTCTGA CAAAGACGGT 900 

ATGCGCATCG TGATTGAAGT GAAACGCGAT GCGGTCGGTG AAGTTGTGCT 950 

15 CAACAACCTC TACTCCCAGA CCCAGTTGCA GGTTTCTTTC GGTATCAACA 1000 

TGGTGGCATT GCACCATGGT. CAGCCGAAGA TCATGAACCT GAAAGACATC 1050 

ATCGCGGCGT TTGTTCGTCA CCGCCGTGAA GTGGTGACCC GTCGTACTAT 1100 

TTTCGAACTG CGTAAAGCTC GCGATCGTGC TCATATCCTT GAAGCATTAG 1150 

CCGTGGCGCT GGCGAACATC GACCCGATCA TCGAACTGAT CCGTCATGCG 1200 

2 0 CCGACGCCTG CAGAAGCGAA AACTGCGCTG GTTGCTAATC CGTGGCAGCT 1250 

GGGCAACGTT GCCGCGATGC TCGAACGTGC TGGCGACGAT GCTGCGCGTC 1300 

CGGAATGGCT GGAGCCAGAG TTCGGCGTGC GTGATGGTCT GTACTACCTG 1350 

ACCGAACAGC AAGCTCAGGC GATTCTGGAT CTGCGTTTGC AGAAACTGAC 14 00 

CGGTCTTGAG CACGAAAAAC TGCTCGACGA ATACAAAGAG CTGCTGGATC 1450 

25 AGATCGCGGA ACTGTTGCGT ATTCTTGGTA GCGCCGATCG TCTGATGGAA 1500 

GTGATCCGTG AAGAGCTGGA GCTGGTTCGT GAACAGTTCG GTGACAAACG 1550 

TCGTACTGAA ATCACCGCCA ACAGCGCAGA CATCAACCTG GAAGATCTGA 1600 

TCACCCAGGA AGATGTGGTC GTGACGCTCT CTCACCAGGG CTACGTTAAG 1650 

TATCAGCCGC TTTCTGAATA CGAAGCGCAG CGTCGTGGCG GGAAAGGTAA 1700 

30 ATCTGCCGCA CGTATTAAAG AAGAAGACTT TATCGACCGA CTGCTGGTGG 1750 

CGAACACTCA CGACCATATT CTGTGCTTCT CCAGCCGTGG TCGCGTCTAT 1800 

TCGATGAAAG TTTATCAGTT GCCGGAAGCC ACTCGTGGCG CGCGCGGTCG 1850 

TCCGATCGTC AACCTGCTGC CGCTGGAGCA GGACGAACGT ATCACTGCGA 1900 

TCCTGCCAGT GACCGAGTTT GAAGAAGGCG TGAAAGTCTT CATGGCGACC 1950 

35 GCTAACGGTA CCGTGAAGAA AACTGTCCTC ACCGAGTTCA ACCGTCTGCG 2000 

TACCGCCGGT AAAGTGGCGA TCAAACTGGT TGACGGCGAT GAGCTGATCG 2050 

GCGTTGACCT GACCAGCGGC GAAGACGAAG TAATGCTGTT CTCCGCTGAA 2100 

GGTAAAGTGG TGCGCTTTAA AGAGTCTTCT GTCCGTGCGA TGGGCTGCAA 2150 

CACCACCGGT GTTCGCGGTA TTCGCTTAGG TGAAGGCGAT AAAGTCGTCT 2200 

4 0 CTCTGATCGT GCCTCGTGGC GATGGCGCAA TCCTCACCGC AACGCAAAAC 2250 

GGTTACGGTA AACGTACCGC AGTGGCGGAA TACCCAACCA AGTCGCGTGC 2300 

GACGAAAGGG GTTATCTCCA TCAAGGTTAC CGAACGTAAC GGTTTAGTTG 2350 

TTGGCGCGGT ACAGGTAGAT GACTGCGACC AGATCATGAT GATCACCGAT 2400 

GCCGGTACGC TGGTACGTAC TCGCGTTTCG GAAATCAGCA TCGTGGGCCG 2450 

45 TAACACCCAG GGCGTGATCC TCATCCGTAC TGCGGAAGAT GAAAACGTAG 2500 

TGGGTCTGCA ACGTGTTGCT GAACCGGTTG ACGAGGAAGA TCTGGATACC 2550 

ATCGACGGCA GTGCCGCGGA AGGGGACGAT GAAATCGCTC CGGAAGTGGA 2 600 

CGTTGACGAC GAGCCAGAAG AAGAATAA 2628 

50 

2) INFORMATION FOR SEQ ID NO: 1301 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1301 
GTIMGIAWIM GICCIGSIAT GTA 



2) INFORMATION FOR SEQ ID NO: 1302 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1302 
TAIADIGGIG GIKKIGCIAT RTA 



2) INFORMATION FOR SEQ ID NO: 1303 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1303 
GGIGAIGAID YIMGIGARGG 



2) INFORMATION FOR SEQ ID NO: 1304 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

707 

SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CAOO/01150 



5 



25 



35 



40 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1304 
CIARYTTIKY ITTIGTYTG 19 



10 2) INFORMATION FOR SEQ ID NO: 1305 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases ' 

(B) TYPE: Nucleic acid 
15 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: , Linear 

(ii) MOLECULE TYPE: DNA 
20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1305 

ATGGTGACTG CATTGTCAGA TG 22 



2) INFORMATION FOR SEQ ID NO: 1306 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 22 bases 
30 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1306 
GTCTACGGTT TTCTACAACG TC 22 

2) INFORMATION FOR SEQ ID NO: 1307 



(i) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 1923 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

50 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus aureus 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1307 

5 ATGGTGACTG CATTGTCAGA TGTAAACAAC ACGGATAATT ATGGTGCTGG 50 

GCAAATACAA GTATTAGAAG GTTTAGAAGC AGTACGTAAA AGACCAGGTA 100 

TGTATATAGG ATCGACTCAG AGAGAGTTGC ACATTAGTGT GGAAATTGTC 150 

GATAATAGTA TCGATGAAGC ATTAGCTGGT TATGCAAATA AAATTGAAGT 200 

TGTTATTGAA AAAGATAACT GGATTAAAGT AACGGATAAC GGACGTGGTA 250 

10 TCCCAGTTGA TATTCAAGAA AAAATGGGAC GTCCAGCTGT CGAAGTTATT 300 

TTAACTGTTT TACATGCTGG TGGTAAATTC GGCGGTGGCG GATACAAAGT 350 

ATCTGGTGGT TTACATGGTG TTGGTTCATC AGTTGTAAAC GCATTGTCAC 4 00 

AAGACTTAGA AGTATATGTA CACAGAAATG AGACTATATA TCATCAAGCA 4 50 

TATAAAAAAG GTGTACCTCA ATTTGACTTA' AAAGAAGTTG GCACAACTGA 500 

15 TAAGACAGGT ACTGTCATTC GTTTTAAAGC AGATGGAGAA ATCTTCACAG 550 

AGACAACTGT ATACAACTAT GAAACATTAC AGCAGCGTAT TAGAGAGCTT 600 

GCTTTCTTAA ACAAAGGAAT TCAAATCACA TTAAGAGATG AACGTGATGA 650 

AGAAAACGTT AGAGAAGACT CCTATCACTA TGAGGGCGGT ATTAAATCGT 700 

ACGTTGAGTT ATTGAACGAA AATAAAGAAC CTATTCATGA TGAGCCAATT 750 

20 TATATTCATC AATCTAAAGA TGATATTGAA GTAGAAATTG CGATTCAATA 800 

TAACTCAGGA TATGCCACAA ATCTTTTAAC TTACGCAAAT AACATTCATA 850 

CGTACGAAGG TGGTACGCAT GAAGACGGAT TCAAACGTGC ATTAACGCGT 900 

GTCTTAAATA GTTATGGTTT AAGTAGCAGA TATGAAGAAG AAAAGATAGC 950 

TTCTGGTGAA GATACACGAG AAGGTATGAC AGCAATTATA TCTATCAAAC 1000 

25 ATGGTGATCC TCAATTCGAA GGTCAAACGA AGACAAAATT AGGTAATTCT 1050 

GAAGTGCGTC AAGTTGTAGA TAAATTATTC TCAGAGCACT TTGAACGATT 1100 

TTTATATGAA AATCCACAAG TCGCACGTAC AGTGGTTGAA AAAGGTATTA 1150 

TGGCGGCACG TGCACGTGTT GCTGCGAAAA AAGCGCGTGA AGTAACACGT 1200 

CGTAAATCAG CGTTAGATGT AGCAAGTCTT CCAGGTAAAT TAGCCGATTG 1250 

30 CTCTAGTCAA AGTCCTGAAG AATGTGAGAT TTTCTTAGTC GAAGGGGACT 1300 

CTGCCGGAGG GTCTACAAAA TCTGGTCGTG ACTCTAGAAC GCAGGCGATT 1350 

TTACCATTAC GAGGTAAGAT ATTAAATGTT GAAAAAGCAC GATTAGATAG 14 00 

AATTTTGAAT AACAATGAAA TTCGTCAAAT GATCACAGCA TTTGGTACAG 14 50 

GAATCGGTGG CGACTTTGAT CTAGCGAAAG CAAGATATCA CAAAATCGTC 1500 

35 ATTATGACTG ATGCCGATGT GGATGGAGCG CATATTAGAA CATTGTTATT 1550 

AATATTCTTC TATCGATTTA TGAGACCGTT AATTGAAGCA GGCTATGTGT 1600 

ATATTGCACA GCCACCGTTG TATAAACTGA CACAAGGTAA ACAAAAGTAT 1650 

TATGTATACA ATGATAGGGA ACTTGATAAA CTTAAATCTG AATTGAATCC 1700 

AACACCAAAA TGGTCTATTG CGCTATACAA AGGTCTTGGA GAAATGAATG 1750 

4 0 CAGATCAATT ATGGGAAACA ACAATGAACC CTGAGCACCG CGCTCTTTTA 1800 

CAAGTAAAAC TTGAAGATGC GATTGAAGCG GACCAAACAT TTGAAATGTT 1850 

AATGGGTGAC GTTGTAGAAA ACCGTAGACA ATTTATAGAA GATAATGCAG 1900 

TTTATGCAAA CTTAGACTTC TAA 1923 

45 

2) INFORMATION FOR SEQ ID NO: 1308 

(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNSSS: Single 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1308 
ATGTAYGTIA TIATGGAYMG IGC 



2) INFORMATION FOR SEQ ID NO: 1309 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases' 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY:. Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1309 
ATIATYTTRT TICCYTTICC YTT 



2) INFORMATION FOR SEQ ID NO: 1310 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1310 
ATIATITSIA TIACYTCRTC 



2) INFORMATION FOR SEQ ID NO: 1311 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1311 
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2) INFORMATION FOR SEQ ID NO: 1312 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1312 
AARTAYATIA TICARGARMG IGC 



2) INFORMATION FOR SEQ ID NO: 1313 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1313 
AMIAYICKRT GIGGITTITT YTT 



2) INFORMATION FOR SEQ ID NO: 1314 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1314 
TAIGAITTYA CIGAISMICA RGC 



2) INFORMATION FOR SEQ ID NO: 1315 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1315 
ACIATIGCIT CIGCYTGIKS YTC 



2) INFORMATION FOR SEQ ID NO: 1316 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: . Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1316 
GTGAGTGAAA TAATTCAAGA TT 



2) INFORMATION FOR SEQ ID NO: 1317 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1317 
CACCAAAATC ATCTGTATCT AC 



2) INFORMATION FOR SEQ ID NO: 1318 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
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(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1318 
ACCTAYTCSA TGTACGTRAT CATGGA 



2) INFORMATION FOR SEQ ID NO: 1319 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1319 
AGRTCGTCIA CCATCGGYAG YTT 



2) INFORMATION FOR SEQ ID NO: 1320 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2259 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 

(B) STRAIN: K-12 MG1655 

(C) ACCESSION NUMBER: AE000384 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1320 

ATGAGCGATA TGGCAGAGCG CCTTGCGCTA CATGAATTTA CGGAAAACGC 50 

CTACTTAAAC TACTCCATGT ACGTGATCAT GGACCGTGCG TTGCCGTTTA 100 

TTGGTGATGG TCTGAAACCT GTTCAGCGCC GCATTGTGTA TGCGATGTCT 150 

GAACTGGGCC TGAATGCCAG CGCCAAATTT AAAAAATCGG CCCGTACCGT 200 

CGGTGACGTA CTGGGTAAAT ACCATCCGCA CGGCGATAGC GCCTGTTATG 250 

AAGCGATGGT CCTGATGGCG CAACCGTTCT CTTACCGTTA TCCGCTGGTT 300 

GATGGTCAGG GGAACTGGGG CGCGCCGGAC GATCCGAAAT CGTTCGCGGC 350 

AATGCGTTAC ACCGAATCCC GGTTGTCGAA ATATTCCGAG CTGCTATTGA 400 

GCGAGCTGGG GCAGGGGACG GCTGACTGGG TGCCAAACTT CGACGGCACT 450 

TTGCAGGAGC CGAAAATGCT ACCTGCCCGT CTGCCAAACA TTTTGCTTAA 500 
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CGGCACCACC GGTATTGCCG TCGGCATGGC GACCGATATT CCACCGCATA 550 

ACCTGCGTGA AGTGGCTCAG GCGGCAATCG CATTAATCGA CCAGCCGAAA 600 

ACCACGCTCG ATCAGCTGCT GGATATCGTG CAGGGGCCGG ATTATCCGAC 650 

TGAAGCGGAA ATTATCACTT CGCGCGCCGA GATCCGTAAA ATCTACGAGA 700 

5 ACGGACGTGG TTCAGTGCGT ATGCGCGCGG TGTGGAAGAA AGAAGATGGC 750 

GCGGTGGTTA TCAGCGCATT GCCGCATCAG GTTTCAGGTG CGCGCGTACT 800 

GGAGCAAATT GCTGCGCAAA TGCGCAACAA AAAGCTGCCG ATGGTTGACG 850 

ATCTGCGCGA TGAATCTGAC CACGAGAACC CGACCCGCCT GGTGATTGTG 900 

CCGCGTTCCA ACCGCGTGGA TATGGATCAG GTGATGAACC ACCTCTTCGC 950 

10 TACCACCGAT CTGGAAAAGA GCTATCGTAT TAACCTTAAT ATGATCGGTC 1000 

TGGATGGTCG TCCGGC.GGTG AAAAACCTGC TGGAAATCCT CTCCGAATGG 1050 

CTGGTGTTCC GCCGCGATAC CGTGCGCCGC CGACTGAACT ATCGTCTGGA 1100 

GAAAGTCCTC AAGCGCCTGC ATATCCTCGA AGGTTTGCTG GTGGCGTTTC 1150 

TCAATATCGA CGAAGTGATT GAGATCATTC GTAATGAAGA TGAACCGAAA 1200 

15 CCGGCGCTGA TGTCGCGGTT TGGCCTTACG GAAACCCAGG CGGAAGCGAT 1250 

CCTCGAACTG AAACTGCGTC ATCTTGCCAA ACTGGAAGAG ATGAAGATTC 1300 

GCGGTGAGCA GAGTGAACTG GAAAAAGAGC GCGACCAGTT GCAGGGCATT 1350 

TTGGCTTCCG AGCGTAAAAT GAATAACCTG CTGAAGAAAG AACTGCAGGC 1400 

AGACGCGCAA GCCTACGGTG ACGATCGTCG TTCGCCGTTG CAGGAACGCG 14 50 

20 AAGAAGCGAA AGCGATGAGC GAGCACGACA TGCTGCCGTC TGAACCTGTC 1500 

ACCATTGTGC TGTCGCAGAT GGGCTGGGTA CGCAGCGCTA AAGGCCATGA 1550 

TATCGACGCG CCGGGCCTGA ATTATAAAGC GGGTGATAGC TTCAAAGCGG 1600 

CGGTGAAAGG TAAGAGCAAC CAACCGGTAG TGTTTGTTGA TTCCACCGGT 1650 

CGTAGCTATG CCATTGACCC GATTACGCTG CCGTCGGCGC GTGGTCAGGG 1700 

25 CGAGCCGCTC ACCGGCAAAT TAACGTTGCC GCCTGGGGCG ACCGTTGACC 1750 

ATATGCTGAT GGAAAGCGAC GATCAGAAAC TGCTGATGGC TTCCGATGCG 1800 

GGTTACGGTT TCGTCTGCAC CTTTAACGAT CTGGTGGCGC GTAACCGTGC 1850 

AGGTAAGGCT TTGATCACCT TACCGGAAAA TGCCCATGTT ATGCCGCCGG 1900 

TGGTGATTGA AGATGCTTCC GATATGCTGC TGGCAATCAC TCAGGCAGGC 1950 

30 CGTATGTTGA TGTTCCCGGT AAGTGATCTG CCGCAGCTGT CGAAGGGCAA 2000 

AGGCAACAAG ATTATCAACA TTCCATCGGC AGAAGCCGCG CGTGGAGAAG 2050 

ATGGTCTGGC GCAATTGTAC GTTCTGCCGC CGCAAAGCAC GCTGACCATT 2100 

CATGTTGGGA AACGCAAAAT TAAACTGCGC CCGGAAGAGT TACAGAAAGT 2150 

CACTGGCGAA CGTGGACGCC GCGGTACGTT GATGCGCGGT TTGCAGCGTA 2200 

3 5 TCGATCGTGT TGAGATCGAC TCTCCTCGCC GTGCCAGCAG CGGTGATAGC 2250 

GAAGAGTAA 2259 



40 2) INFORMATION FOR SEQ ID NO: 1321 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2403 bases 

(B) TYPE: Nucleic acid 
45 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

50 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus aureus 

(B) STRAIN: KMP9 

(C) ACCESSION NUMBER: D67074 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1321 

GTGAGTGAAA TAATTCAAGA TTTATCACTT GAAGATGTTT TAGGTGATCG 50 

5 CTTTGGAAGA TATAGTAAAT ATATTATTCA AGAGCGTGCA TTGCCAGATG 100 

TTCGTGATGG TTTAAAACCA GTACAACGTC GTATTTTATA TGCAATGTAT 150 

TCAAGTGGTA ATACACACGA TAAAAATTTC CGTAAAAGTG CGAAAACAGT 200 

CGGTGATGTT ATTGGTCAAT ATCATCCACA TGGAGACTTC TCAGTGTACA 250 

AAGCAATGGT CCGTTTAAGT CAAGACTGGA AGTTACGACA TGTCTTAATA 300 

10 GAAATGCATG GTAATAATGG TAGTATCGAT AATGATCCGC CAGCGGCAAT 350 

GCGTTACACT GAAGCTAAGT TAAGCTTACT AGCTGAAGAG TTATTACGTG 400 

ATATTAATAA AGAGACAGTT TCTTTCATTC CAAACTATGA TGATACGACA 450 

CTCGAACCAA TGGTATTGCC ATCAAGATTT CCTAACTTAC TAGTGAATGG 500 

TTCTACAGGT ATATCTGCAG GTTACGCGAC AGATATACCA CCACATAATT 550 

15 TAGCTGAAGT GATTCAAGCA ACACTTAAAT ATATTGATAA TCCGGATATT 600 

ACAGTCAATC AATTAATGAA ATATATTAAA GGTCCTGATT TTCCAACTGG 650 

TGGTATTATT CAAGGTATTG ATGGTATTAA AAAAGCTTAT GAATCAGGTA 700 

AAGGTAGAAT TATAGTTCGT TCTAAAGTTG AAGAAGAAAC TTTACGCAAT 750 

GGACGTAAAC AGTTAATTAT TACTGAAATT CCATATGAAG TGAACAAAAG 800 

20 TAGCTTAGTA AAACGTATCG ATGAATTACG TGCTGACAAA AAAGTCGATG 850 

GTATCGTTGA AGTACGTGAT GAAACTGATA GAACTGGTTT ACGAATAGCA 900 

ATTGAATTGA AAAAAGATGT GAACAGTGAA TCAATCAAAA ATTATCTTTA 950 

TAAAAACTCT GATTTACAGA TTTCATATAA TTTCAACATG GTCGCTATTA 1000 

GTGATGGTCG TCCAAAATTG ATGGGTATTC GTCAAATTAT AGATAGTTAT 1050 

25 TTGAATCATC AAATTGAGGT TGTTGCAAAT AGAACGAAGT TTGAATTAGA 1100 

TAATGCTGAA AAACGTATGC ATATCGTTGA AGGTTTGATT AAAGCGTTGT 1150 

CAATTTTAGA TAAAGTAATT GAATTGATTC GTAGCTCTAA AAACAAGCGT 1200 

GACGCTAAAG AAAACCTTAT CGAAGTATTC GAGTTCACAG AAGAACAGGC 1250 

TGAAGCAATT GTAATGTTAC AGTTATATCG TTTAACAAAC ACTGACATAG 1300 

30 TTGCGCTTGA AGGTGAACAT AAAGAACTTG AAGCATTAAT CAAACAATTA 1350 

CGTCATATTC TTGATAACCA TGATGCATTA TTGAATGTCA TAAAAGAAGA 1400 

ATTGAATGAA ATTAAAAAGA AATTCAAATC TGAACGACTG TCTTTAATTG 14 50 

AAGCAGAAAT TGAAGAAATT AAAATTGACA AAGAAGTTAT GGTGCCTAGT 1500 

GAAGAAGTTA TTTTAAGTAT GACACGTCAT GGATATATTA AACGTACTTC 1550 

35 TATTCGTAGC TTTAATGCTA GCGGTGTTGA GGATATTGGT TTAAAAGATG 1600 

GTGACAGTTT ACTTAAACAT CAAGAAGTAA ATACGCAAGA TACCGTACTA 1650 

GTATTTACAA ATAAAGGTCG TTATCTATTT ATACCGGTTC ATAAATTAGC 1700 

AGATATTCGT TGGAAAGAAT TGGGGCAACA TGTATCACAA ATAGTTCCTA 1750 

TCGAAGAAGA TGAAGTGGTT ATTAATGTCT TTAATGAAAA GGACTTTAAT 1800 

40 ACAGATGCAT TTTATGTTTT TGCGACTCAA AATGGCATGA TTAAGAAAAG 1850 

TACAGTGCCT CTATTTAAAA CAACGCGTTT TAATAAACCT TTAATTGCTA 1900 

CTAAAGTTAA AGAAAATGAT GATTTGATTA GTGTTATGCG CTTTGAAAAA 1950 

GATCAATTAA TTACCGTCAT TACTAATAAA GGTATGTCAT TAACGTATAA 2000 

TACAAGTGAA CTATCAGATA CCGGATTAAG GGCAGCTGGT GTTAAATCAA 2050 

45 TAAATCTTAA AGCTGAAGAT TTCGTTGTTA TGACAGAAGG TGTTTCTGAA 2100 

AATGATACTA TATTGATGGC CACACAACGC GGCTCGTTAA AACGTATTAG 2150 

TTTTAAAATC TTACAAGTTG CTAAAAGAGC ACAACGTGGA ATAACTTTAT 2200 

TAAAAGAATT AAAGAAAAAT CCACATCGTA TTGTAGCTGC ACATGTAGTG 2250 

ACAGGTGAAC ATAGTCAATA TACATTATAT TCAAAATCAA ATGAAGAACA 2300 

50 TGGTTTAATT AATGATATTC ATAAATCTGA ACAATATACA AATGGCTCAT 2350 

TCATTGTAGA TACAGATGAT TTTGGTGAAG TAATAGACAT GTATATTAGC 2400 

TAA 2403 
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2) INFORMATION FOR SEQ ID NO: 1322 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1322 
RTIGAIAAYI SIGTIGAYGA RG 22 



2) INFORMATION FOR SEQ ID NO: 1323 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1323 
ACIAWRSAIG GIGGIACICA YG 22 

30 

2) INFORMATION FOR SEQ ID NO: 1324 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

40 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1324 
45 CCICCIGCIS WRTCICCYTC 20 



2) INFORMATION FOR SEQ ID NO: 1325 

50 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

716 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CAOO/01150 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1325 
RTTCATYTCI CCIARICCYT T 



2) INFORMATION FOR SEQ ID NO: 1326 

(i) SEQUENCE CHARACTERISTICS:' 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1326 
TGATTCAATA CAGGTTTTAG AG 



2) INFORMATION FOR SEQ ID NO: 1327 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1327 
CTAGATTTCC TCCTCATCAA AT 



2) INFORMATION FOR SEQ ID NO: 1328 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1992 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus aureus 

(B) STRAIN: RN4220 

(C) ACCESSION NUMBER: D67075 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1328 

ATGAATAAAC AAAATAATTA TTCAGATGAT TCAATACAGG TTTTAGAGGG 50 

GTTAGAAGCA GTTCGTAAAA GACCTGGTAT GTATATTGGA TCAACTGATA 100 

10 AACGGGGATT ACATCATCTA GTATATGAAA TTGTCGATAA CTCCGTCGAT 150 

GAAGTATTGA ATGGTTACGG TAACGAAATA GATGTAACAA TTAATAAAGA 200 

TGGTAGTATT TCTATAGAAG ATAATGGACG TGGTATGCCA ACAGGTATAC 250 

ATAAATCAGG TAAACCGACA GTCGAAGTTA TCTTTACTGT TTTACATGCA 300 

GGAGGTAAAT TTGGACAAGG CGGCTATAAA ACTTCAGGTG GTCTTCACGG 350 

15 TGTTGGTGCT TCAGTTGTAA ATGCATTGAG TGAATGGCTT GAAGTTGAAA 400 

TCCATCGAGA TGGTAATATA TATCATCAAA GTTTTAAAAA CGGTGGTTCG 4 50 

CCATCTTCTG GTTTAGTGAA AAAAGGTAAA ACTAAGAAAA CAGGTACCAA 500 

AGTAACATTT AAACCTGATG ACACAATTTT TAAAGCATCT ACATCATTTA 550 

ATTTTGATGT TTTAAGTGAA CGACTACAAG AGTCTGCGTT CTTATTGAAA 600 

20 AATTTAAAAA TAACGCTTAA TGATTTACGC AGTGGTAAAG AGCGTCAAGA 650 

GCATTACCAT TATGAAGAAG GAATCAAAGA GTTTGTTAGT TATGTCAATG 700 

AAGGAAAAGA AGTTTTGCAT GACGTGGCTA CATTTTCAGG TGAAGCAAAT 750 

GGTATAGAGG TAGACGTAGC TTTCCAATAT AATGATCAAT ATTCAGAAAG 800 

TATTTTAAGT TTTGTAAATA ATGTACGTAC TAAAGATGGT GGTACACATG 850 

25 AAGTTGGTTT TAAAACAGCA ATGACACGTG TATTTAATGA TTATGCACGT 900 

CGTATTAATG AACTTAAAAC AAAAGATAAA AACTTAGATG GTAATGATAT 950 

TCGTGAAGGT TTAACAGCTG TTGTGTCTGT TCGTATTCCA GAAGAATTAT 1000 

TGCAATTTGA AGGACAAACG AAATCTAAAT TGGGTACTTC TGAAGCTAGA 1050 

AGTGCTGTTG ATTCAGTTGT TGCAGACAAA TTGCCATTCT ATTTAGAAGA 1100 

3 0 AAAAGGACAA TTGTCTAAAT CACTTGTGAA AAAAGCGATT AAAGCACAAC 1150 

AAGCAAGGGA AGCTGCACGT AAAGCTCGTG AAGATGCTCG TTCAGGTAAG 1200 

AAAAACAAGC GTAAAGACAC TTTGCTATCT GGTAAATTAA CACCTGCACA 1250 

AAGTAAAAAC ACTGAAAAAA ATGAATTGTA TTTAGTCGAA GGTGATTCTG 1300 

CGGGAGGTTC AGCAAAACTT GGACGAGACC GCAAATTCCA AGCGATATTA 1350 

35. CCATTACGTG GTAAGGTAAT TAATACAGAG AAAGCACGTC TAGAAGATAT 1400 

TTTTAAAAAT GAAGAAATTA ATACAATTAT CCACACAATC GGGGCAGGCG 14 50 

TTGGTACTGA CTTTAAAATT GAAGATAGTA ATTATAATCG TGTAATTATT 1500 

ATGACTGATG CTGATACTGA TGGTGCGCAT ATTCAAGTGC TATTGTTAAC 1550 

ATTCTTCTTC AAATATATGA AACCGCTTGT TCAAGCAGGT CGTGTATTTA 1600 

4 0 TTGCTTTACC TCCACTTTAT AAATTGGAAA AAGGTAAAGG CAAAACAAAG 1650 

CGAGTTGAAT ACGCTTGGAC AGACGAAGAG CTTAATAAAT TGCAAAAAGA 1700 

ACTTGGTAAA GGCTTCACGT TACAACGTTA CAAAGGTTTG GGTGAAATGA 1750 

ACCCTGAGCA ATTATGGGAA ACGACGATGA ACCCAGAAAC ACGAACTTTA 1800 

ATTCGTGTAC AAGTTGAAGA TGAAGTGCGT TCATCTAAAC GTGTAACAAC 1850 

4 5 ATTAATGGGT GACAAAGTAC AACCTAGACG TGAATGGATT GAAAAGCATG 1900 

TTGAGTTTGG TATGCAAGAG GACCAAAGTA TTTTAGATAA TTCTGAAGTA 1950 

CAAGTGCTTG AAAATGATCA ATTTGATGAG GAGGAAATCT AG 1992 



50 



2) INFORMATION FOR SEQ ID NO: 1329 

(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1329 
TGTAGAGCGC GGTATCATCA AAGTA 



2) INFORMATION FOR SEQ ID NO: 1330 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1330 
AGATTCGAAC TTGGTGTGCG GG 



2) INFORMATION FOR SEQ ID NO: 1331 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1331 
GCCCTTGAGG TACAGAATGG TAATGAAGTT 



2) INFORMATION FOR SEQ ID NO: 1332 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1332 
5 GACCGCGGCG CAGACCATCA 20 



2) INFORMATION FOR SEQ ID NO: 1333 



10 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
15 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1333 

20 

TCATGGTGAC TTATCTATTT ATG 23 



25 2) INFORMATION FOR SEQ ID NO: 1334 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 
30 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1334 

CATCTATTTA TAAAGCAATG GTA 23 



40 



50 



2) INFORMATION FOR SEQ ID NO: 1335 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 19 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1335 

CTATTTATGG AGCAATGGT 19 
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2 ) INFORMATION FOR SEQ ID NO: 1336 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1336 
TGGAGACTAC TCAGTGT 17 



2) INFORMATION FOR SEQ ID NO: 1337 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 17 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1337 
TGGAGACTTC TCAGTGT 17 

30 

2) INFORMATION FOR SEQ ID NO: 1338 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

40 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1338 
45 GTGTACGGAG CAATG 15 



2) INFORMATION FOR SEQ ID NO: 1339 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 bases 

(B) TYPE: Nucleic acid 
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(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1339 
CCAGCGGAAA TGCGT 



2) INFORMATION FOR SEQ ID NO: 1340 

(i) SEQUENCE CHARACTERISTICS:' 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1340 
GAACAAGGTA TGACACCGGA TAAAT 



2) INFORMATION FOR SEQ ID NO: 1341 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1341 
GATAACTGAA ATCCTGAGCC ATACG 



2) INFORMATION FOR SEQ ID NO: 1342 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1342 

GATGTTATTG GTCAATATCA TCCA 24 

5 

2) INFORMATION FOR SEQ ID NO: 1343 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 29 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear' 

15 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1343 
AAGAAACTGT CTCTTTATTA ATATCACGT 29 

20 

2) INFORMATION FOR SEQ ID NO: 1344 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

30 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1344 
35 AGCAGCAACG ATGTTACGCA GCAG 24 



40 



2) INFORMATION FOR SEQ ID NO: 1345 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
45 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1345 

50 

CCCGCCGAGC ATTTCAACTA TTG 23 
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2) INFORMATION FOR SEQ ID NO: 1346 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1346 
GATGTTACGC AGCAGGGCAG TC 22 



2) INFORMATION FOR SEQ ID NO: 1347 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1347 
ACCAAGCAGG TTCGCAGTCA AGTA 24 



2) INFORMATION FOR SEQ ID NO: 1348 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 750 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Unidentified bacterium 
(C) ACCESSION NUMBER: X04555 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1348 

ATGCGCTCAC GCAACTGGTC CAGAACCTTG ACCGAACGCA GCGGTGGTAA 50 

CGGCGCAGTG GCGGTTTTCA TGGCTTGTTA TGACTGTTTT TTTGTACAGT 100 

CTATGCCTCG GGCATCCAAG CAGCAAGCGC GTTACGCCGT GGGTCGATGT 150 

TTGATGTTAT GGAGCAGCAA CGATGTTACG CAGCAGGGCA GTCGCCCTAA 200 

AACAAAGTTA GGCCGCATGG ACACAACGCA GGTCACATTG ATACACAAAA 250 
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TTCTAGCTGC GGCAGATGAG CGAAATCTGC CGCTCTGGAT CGGTGGGGGC 300 

TGGGCGATCG ATGCACGGCT AGGGCGTGTA ACACGCAAGC ACGATGATAT 350 

TGATCTGACG TTTCCCGGCG AGAGGCGCGG CGAGCTCGAG GCAATAGTTG 400 

AAATGCTCGG CGGGCGCGTC ATGGAGGAGT TGGACTATGG ATTCTTAGCG 450 

5 GAGATCGGGG ATGAGTTACT TGACTGCGAA CCTGCTTGGT GGGCAGACGA 500 

AGCGTATGAA ATCGCGGAGG CTCCGCAGGG CTCGTGCCCA GAGGCGGCTG 550 

AGGGCGTCAT CGCCGGGCGG CCAGTCCGTT GTAACAGCTG GGAGGCGATC 600 

ATCTGGGATT ACTTTTACTA TGCCGATGAA GTACCACCAG TGGACTGGCC 650 

TACAAAGCAC ATAGAGTCCT ACAGGCTCGC ATGCACCTCA CTCGGGGCGG 700 

10 AAAAGGTTGA GGTCTTGCGT GCCGCTTTCA GGTCGCGATA TGCGGCCTAA 750 



2) INFORMATION FOR SEQ ID NO: 1349 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1349 
25 

CAGCCGACCA ATGAGTATCT TGCC 24 



30 2) INFORMATION FOR SEQ ID NO: 1350 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 
35 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1350 

TAATCAGGGC AGTTGCGACT CCTA 24 



45 

2) INFORMATION FOR SEQ ID NO: 1351 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 531 bases 
50 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Pseudomonas aeruginosa 

(B) STRAIN: Stone 130 

(C) ACCESSION NUMBER: L06157 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1351 

ATGTTATGGA GCAGCAACGA TGTTACGCAG CAGGGCAGTC GCCCTAAAAC 50 

AAAGTTAGGT GGCTCAATGA GCATCATTGC AACCGTCAAG ATCGGCCCTG 100 

ACGAAATTTC AGCCATGAGG GCTGTGCTCG ATCTCTTCGG CAAAGAGTTT 150 

GAGGACATTC CAACCTACTC TGATCGCCAG CCGACCAATG AGTATCTTGC 200 

CAATCTTCTG CACAGCGAGA CGTTCATCGC GCTCGCTGCT TTTGACCGCG 250 

GAACAGCAAT AGGTGGGCTC GCCGCCTACG TTCTACCCAA GTTCGAGCAA 300 

GCGCGAAGCG AGATCTACAT TTATGACTTG GCAGTCGCTT CCAGCCATCG 350 

AAGGCTAGGA GTCGCAACTG CCCTGATTAG CCACCTGAAG CGTGTGGCGG 4 00 

TTGAACTTGG CGCGTATGTA ATCTATGTGC AAGCAGACTA CGGTGACGAT 4 50 

CCGGCAGTCG CTCTCTACAC AAAGCTTGGA GTTCGGGAAG ACGTCATGCA 500 

CTTCGACATT GATCCAAGAA CCGCCACCTA A 531 



2) INFORMATION FOR SEQ ID NO: 1352 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRAW DEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1352 
CCACGCTGAC AGAGCCGCAC CG 22 



2) INFORMATION FOR SEQ ID NO: 1353 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1353 
GGCCAGCTCC CATCGGACCC TG 22 
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10 



15 



40 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1354 
CACGCTGACA GAGCCGCACC G 21 

2) INFORMATION FOR SEQ ID NO: 1355 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1355 
ATGCCGTTGC TGTCGAAATC CTCG 24 

30 



2) INFORMATION FOR SEQ ID NO: 1356 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 810 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serratia marcescens 
45 (C) ACCESSION NUMBER: M97172 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1356 

ATGAACACGA TCGAATCGAT CACGGCGGAC CTGCACGGAC TGGGCGTCCG 50 

50 GCCCGGCGAC CTGATCATGG TCCATGCATC GCTGAAAGCC GTCGGCCCGG 100 

TCGAGGGAGG TGCGGCCTCG GTGGTGTCGG CCCTTCGCGC CGCGGTCGGG 150 

TCCGCAGGGA CCCTGATGGG TTATGCCTCA TGGGACCGCT CGCCCTATGA 200 

GGAGACGCTG AACGGCGCGC GGATGGACGA AGAACTGCGC CGCCGGTGGC 250 
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CACCCTTCGA 
AACCGGTTTC 
CGCCTCCATG 
ACCGGCTTGG 
GGGCATGGCG 
CGTGCTGCAT 
TGACCTATGA 
CTGGCCGAGG 
TGGGAAGCCG 
GCCGGCATCG 
GCGCAGGATA 
CGCGCCCTGA 



TCTGGCCACA 
TGCTTGAGGC 
GTCGCGGTCG 
GCAGGCGCTG 
GAAAGGTCCT 
TACGCCGAGG 
AATGCCGATG 
ATTTCGACAG 
GATGCCGTCG 
GGAAGGCATC 
TCGTCTCGTT 



TCCGGTACCT ATCCCGGCTT CGGCCTGCTC 
GCCCGACGCA CGGCGCAGCG CGCATCCCGA 
GCCCCCTTGC CGCCACGCTG ACAGAGCCGC 
GGCGAAGGCT CGCCGCTGGA GCGCTTCGTC 
GCTTCTGGGA GCGCCGCTCG ACTCCGTCAC 
CCATCGCCCC CATCCCGAAC AAACGCCGCG 
CTCGGCCCGG ATGGCAGGGT CCGATGGGAG 
CAACGGCATT CTCGATTGCT TCGCGGTCGA 
AGACGATCGC CAAGGCTTAT GTCGAACTGG 
GTCGGTCGCG CACCCTCCTA TCTGTTTGAA 
CGGCGTCACC TATCTCGAAC AGCATTTCGG 



300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
810 



2) INFORMATION FOR SEQ ID NO: 1357 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1357 
GCCCATCCAT TTGCCTTTGC 2 



2) INFORMATION FOR SEQ ID NO: 1358 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1358 
GCGTACCAAC TTGCCATCCT GAAG 2 



2) INFORMATION FOR SEQ ID NO: 1359 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
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10 



20 



25 



40 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1359 
TGCCCCTGCC ACCTCACTC 19 

2) INFORMATION FOR SEQ ID NO: 1360 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 24 bases 
15 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1360 
CGTACCAACT TGCCATCCTG AAGA 24 

2) INFORMATION FOR SEQ ID NO: 1361 



(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 786 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

35 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 
(C) ACCESSION NUMBER: X01385 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1361 



GTGCAATACG AATGGCGAAA AGCCGAGCTC ATCGGTCAGC TTCTCAACCT 50 

TGGGGTTACC CCCGGCGGTG TGCTGCTGGT CCACAGCTCC TTCCGTAGCG 100 

45 TCCGGCCCCT CGAAGATGGG CCACTTGGAC TGATCGAGGC CCTGCGTGCT 150 

GCGCTGGGTC CGGGAGGGAC GCTCGTCATG CCCTCGTGGT CAGGTCTGGA 200 

CGACGAGCCG TTCGATCCTG CCACGTCGCC CGTTACACCG GACCTTGGAG 250 

TTGTCTCTGA CACATTCTGG CGCCTGCCAA ATGTAAAGCG CAGCGCCCAT 300 

CCATTTGCCT TTGCGGCAGC GGGGCCACAG GCAGAGCAGA TCATCTCTGA 350 

50 TCCATTGCCC CTGCCACCTC ACTCGCCTGC AAGCCCGGTC GCCCGTGTCC 400 

ATGAACTCGA TGGGCAGGTA CTTCTCCTCG GCGTGGGACA CGATGCCAAC 4 50 

ACGACGCTGC ATCTTGCCGA GTTGATGGCA AAGGTTCCCT ATGGGGTGCC 500 

GAGACACTGC ACCATTCTTC AGGATGGCAA GTTGGTACGC GTCGATTATC 550 
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TCGAGAATGA CCACTGCTGT GAGCGCTTTG CCTTGGCGGA CAGGTGGCTC 
AAGGAGAAGA GCCTTCAGAA GGAAGGTCCA GTCGGTCATG CCTTTGCTCG 
GTTGATCCGC TCCCGCGACA TTGTGGCGAC AGCCCTGGGT CAACTGGGCC 
GAGATCCGTT GATCTTCCTG CATCCGCCAG AGGGCGGGAT GCGAAGAATG 
CGATGCCGCT CGCCAGTCGA TTGGCTGAGC TCATGA 



600 
650 
700 
750 
786 



2) INFORMATION FOR SEQ ID NO: 1362 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single' 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1362 
CGCCGCCATC GCCCAAAGCT GG .22 



2) INFORMATION FOR SEQ ID NO: 1363 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1363 
CGGCATAATG GAGCGCGGTG ACTG 24 



2) INFORMATION FOR SEQ ID NO: 1364 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1364 
TTTCTCGCCC ACGCAGGAAA AATC 24 
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2) INFORMATION FOR SEQ ID NO: 1365 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1365 
CATCCTCGAC GAATATGCCG CG 



2) INFORMATION FOR SEQ ID NO: 1366 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 900 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterobacter cloacae 
(C) ACCESSION NUMBER: M88012 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1366 

ATGACTGATC CCCGCAAAAA CGGCGATTTG CACGAACCCG CGACGGCACC 50 

CGCGACGCCC TGGTCCAAAA GCGAGCTGGT CCGGCAATTG CGCGACCTCG 100 

GCGTGCGCTC AGGCGATATG GTGATGCCGC ATGTGTCGTT GCGCGCCGTC 150 

GGGCCGCTGG CGGACGGACC GCAGACACTT GTCGATGCGC TGATCGAGGC 200 

CGTCGGCCCC ACCGGGAATA TTCTCGCCTT CGTCTCGTGG CGCGATTCGC 250 

CCTATGAACA GACGCTGGGT CATGATGCGC CGCCCGCCGC CATCGCCCAA 300 

AGCTGGCCTG CGTTCGACCC CGACCATGCG CCCGCCTACC CCGGCTTTGG 350 

CGCGATCAAC GAATTTATCC GAACCTATCC GGGGTGTCGG CGCACGGCCC 400 

ATCCCGACGC ATCGATGGCG GCGATCGGGC CCGATGCGGC GTGGCTGGTG 4 50 

GCGCCGCACG AGATGGGCGC CGCTTATGGC CCCCGCTCGC CGATCGCGCG 500 

TTTTCTCGCC CACGCAGGAA AAATCCTGTC GATCGGCGCC GGGCCCGATG 550 

CAGTCACCGC GCTCCATTAT GCCGAAGCGG TGGCGCGGAT CGAGGGCAAG 600 

CGCCGCGTCA CTTATTCGAT GCCCTTACTG CGCGAAGGCA AGCGCGTCTG 650 

GGTCACCACG TCCGACTGGG ATTCGAACGG CATCCTCGAC GAATATGCCG 700 

CGCCCGACGG CCCCGACGCG GTCGAACGGA TCGCCCGCGA CTATCTCGCC 750 

CGCACCAGGG TTGCGCAAGG CCCGGTCGGC GGCGCGCAAT CCCGGCTGAT 800 

CGACGCGGCC GATATCGTTT CCTTCGGCAT CGAATGGCTC GAGGCGCGCC 850 

ACGCCGCGCC AGCGGCGGCA GCGCTGAAGC CGAAACAACG CCGCGACTGA 900 
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2) INFORMATION FOR SEQ ID NO: 1367 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1367 
CAAATATACT AACAGAAGCG TTCA 



2) INFORMATION FOR SEQ ID NO: 1368 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1368 
AGGATCTTGC CAATACCTTT AT 



2) INFORMATION FOR SEQ ID NO: 1369 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1369 
AAACCTTTGT TTCGGTCTGC TAAT 



2) INFORMATION FOR SEQ ID NO: 1370 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 
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(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1370 
AAGCGATTCC AATAATACCT TGCT 



24 



10 



15 



20 



25 



30 



35 



2) INFORMATION FOR SEQ ID NO: 1371 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 558 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Citrobacter diversus 
(C) ACCESSION NUMBER: M18967 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1371 



ATGAATTATC 
AGCAGCAAAT 
GGCCAGATAT 
CCAAACCTTT 
AGGCTTAAGG 
TCAGACCAGA 
TTAGAAAACA 
AGATGATGAA 
ATAATATATT 
TATGAGTTTT 
TGCCAATGGT 
AAGAGTAA 



AAATTGTGAA 
ATACTAACAG 
GACGAGTGCA 
GTTTCGGTCT 
CCAATGTACA 
TTATCAAAAT 
GAGCTAGAGA 
TACTATAGAA 
TGATTCAATA 
ATCAGAAGAA 
AAAAACAAAC 



TATTGCGGAA 
AAGCGTTCAA 
ACAAAAGAAG 
GCTAATAAAT 
AGGAAACCTG 
AAAGGTATTG 
GCAAGGTATT 
CAAGTCTCTC 
AAAAATATTA 
TGGTTATTAT 
CAGATATTTG 



TGCAGCAATT 
TGATCTTGGT 
TAAAAGAATG 
AACTCCTTAG 
GGAATTGCAT 
GCAAGATCCT 
ATTGGAATCG 
TTTAATAACT 
AAAATATTAA 
ATTGTTGGAA 
GATGTGGAAA 



ATCAGTTAGA 50 

AACAATTCAT 100 

TATTGAGAGT 150 

TTGGCTGGAT 200 

CCATTGGTTG 250 

GCTTAAGGAA 300 

CTTTAGGAAC 350 

ATAACAGAAG 400 

TAAACATCCA 450 

TAATTCCAAA 500 

AGTTTAATCA 550 
558 



40 



2) INFORMATION FOR SEQ ID NO: 1372 

4 5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



50 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1372 
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GCTTTCGTTG CCTTTGCCGA GGTC 



2) INFORMATION FOR SEQ ID NO: 1373 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1373 
CACCCCTGTT GCTTCGCCCA CTC 



2) INFORMATION FOR SEQ ID NO: 1374 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1374 
AGATATTGGC TTCGCCGCAC CACA 



2) INFORMATION FOR SEQ ID NO: 1375 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1375 
CCCTGTTGCT TCGCCCACTC CTG 
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2) INFORMATION FOR SEQ ID NO: 1376 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 441 bases 

(B) TYPE: Nucleic acid 

(C) • STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serratia marcescens 
(C) ACCESSION NUMBER: M94066 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1376 

ATGATCGTCA TCTGCGACCA CGACAACCTC GACGCCTGGC TGGCGCTGCG 50 

CACCGCGCTG TGGCCCTCCG GCTCGCCTGA AGATCACCGC GCGGAAATGC 100 

GCGAGATATT GGCTTCGCCG CACCACACCG CGTTTATGGC GCGGGGGCTG 150 

GACGGCGCTT TCGTTGCCTT TGCCGAGGTC GCGCTGCGCT ACGATTACGT 200 

CAACGGCTGC GAATCGTCGC CGGTGGCGTT TTTGGAAGGA ATTTATACCG 250 

CCGAACGCGC CCGCCGCCAG GGCTGGGCCG CGCGCCTGAT CGCGCAGGTG 300 

CAGGAGTGGG CGAAGCAACA GGGGTGCAGC GAGCTGGCGT CGGATACCGA 350 

TATCGCCAAT CTGGACTCCC AGCGCCTGCA TGCGGCGCTG GGCTTTGCCG 400 

AAACGGAGCG AGTAGTGTTT TACCGCAAAA CGCTGGGCTG A 441 



2) INFORMATION FOR SEQ ID NO: 1377 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1377 
GCCGTGGGTC GATGTTTGAT GTTA 



2) INFORMATION FOR SEQ ID NO: 1378 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1378 
GCTCGATGAC GCCAACTACC TCTG 



2) INFORMATION FOR SEQ ID NO: 1379 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single, 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1379 
AGCAGCAACG ATGTTACGCA GCAG 



2) INFORMATION FOR SEQ ID NO: 1380 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1380 
CGCTCGATGA CGCCAACTAC CTCT 



2) INFORMATION FOR SEQ ID NO: 1381 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 972 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 
(C) ACCESSION NUMBER: X02340 
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GTGGTAACGG CGCAGTGGCG GTTTTCATGG CTTCTTGTTA TGACATGTTT 50 

TTTTGGGGTA CAGTCTATGC CTCGGGCATC CAAGCAGCAA GCGCGTTACG 100 

5 CCGTGGGTCG ATGTTTGATG TTATGGAGCA GCAACGATGT TACGCAGCAG 150 

GGCAGTCGCC CTAAAACAAA GTTAAACATC ATGAGGGAAG CGGTGATCGC 200 

CGAAGTATCG ACTCAACTAT CAGAGGTAGT TGGCGTCATC GAGCGCCATC 250 

TCGAACCGAC GTTGCTGGCC GTACATTTGT ACGGCTCCGC AGTGGATGGC 300 

GGCCTGAAGC CACACAGTGA TATTGATTTG CTGGTTACGG TGACCGTAAG 350 

10 GCTTGATGAA ACAACGCGGC GAGCTTTGAT CAACGACCTT TTGGAAACTT 400 

CGGCTTCCCC TGGAGAGAGC GAGATTCTCC GCGCTGTAGA AGTCACCATT 450 

GTTGTGCACG ACGACATCAT TCCGTGGCGT TATCCAGCTA AGCGCGAACT 500 

GCAATTTGGA GAATGGCAGC GCAATGACAT TCTTGCAGGT ATCTTCGAGC 550 

CAGCCACGAT CGACATTGAT CTGGCTATCT TGCTGACAAA AGCAAGAGAA 600 

15 CATAGCGTTG CCTTGGTAGG TCCAGCGGCG GAGGAACTCT TTGATCCGGT 650 

TCCTGAACAG GATCTATTTG AGGCGCTAAA TGAAACCTTA ACGCTATGGA 700 

ACTCGCCGCC CGACTGGGCT GGCGATGAGC GAAATGTAGT GCTTACGTTG 750 

TCCCGCATTT GGTACAGCGC AGTAACCGGC AAAATCGCGC CGAAGGATGT 800 

CGCTGCCGAC TGGGCAATGG AGCGCCTGCC GGCCCAGTAT CAGCCCGTCA 850 

20 TACTTGAAGC TAGACAGGCT TATCTTGGAC AAGAAGAAGA TCGCTTGGCC 900 

TCGCGCGCAG ATCAGTTGGA AGAATTTGTC CACTACGTGA AAGGCGAGAT 950 

CACCAAGGTA GTCGGCAAAT AA 972 



25 



35 



40 



2) INFORMATION FOR SEQ ID NO: 1382 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 24 bases 
30 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA • 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1382 
TAGATATGAT AGGCGGTAAA AAGC 24 

2) INFORMATION FOR SEQ ID NO: 1383 



(i) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

50 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1383 



737 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 
CCCAAATTCG AGTAAGAGGT ATT 



2) INFORMATION FOR SEQ ID NO: 1384 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1384 
GATATGATAG GCGGTAAAAA GC 22 



2) INFORMATION FOR SEQ ID NO: 1385 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1385 
TCCCAAATTC GAGTAAGAGG TA 22 



2) INFORMATION FOR SEQ ID NO: 1386 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 477 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus aureus 
(C) ACCESSION NUMBER: V01282 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1386 
ATGAAAGAAA GATATGGAAC AGTATATAAA GGCTCTCAGA GGCTCATAGA 50 
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CGAGGAAAGT GGAGAAGTAA TAGAGGTAGA 
CGTCTGGTAA CTTTGTAAAA GCGTATATCG 
GATATGATAG GCGGTAAAAA GCTCAAGATT 
TGTACATCTA AGTAATAACA CAATGATAGC 
5 AAGGAACAAA TACAAGCACG AAAACCGTAA 
GAAGAAGGAA ATATCATTAA AAGAAGAACT 
AGAGCTACTC ATGAGAGGCG ATGACCAAAA 
AATTTGGGAA CTTTGAGCAA GAGGACGACC 
TCAGAATATT ATTCTTTCAA GGAGTAG 



PCT/CA00/01150 

TAAGCTATAC CGTAAACAAA 100 

TCCAATTAAT AAGTATGTTA 150 

GTTAATTATA TATTAGATAA 200 

AACTGTTAGA GAAATAGCAG 250 

ATACAACGCT TAAAATCTTA 300 

GGAGCATTAA TGCTAAACCC 350 

ACAAAAATAC CTCTTACTCG 400 

AAAAGCAAGA AAATGCTTTA 450 

477 



2) INFORMATION FOR SEQ ID NO: 1387 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

20 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1387 
25 TTATGCCTCT TCCGACCATC AAGC 24 



2) INFORMATION FOR SEQ ID NO: 1388 



30 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
35 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1388 

40 

TACGCTCGTC ATCAAAATCA CTCG 24 



45 2) INFORMATION FOR SEQ ID NO: 1389 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 
50 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1389 
GAATAACGGT TTGGTTGATG CGAG 



2) INFORMATION FOR SEQ ID NO: 1390 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single t 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1390 
ATGGCAAGAT CCTGGTATCG GTCT 



2) INFORMATION FOR SEQ ID NO: 1391 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 816 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 
(C) ACCESSION NUMBER: J01839 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1391 



ATGAGCCATA TTCAACGGGA 
CAACATGGAT GCTGATTTAT 
GGCAATCAGG TGCGACAATC 
GAGTTGTTTC TGAAACATGG 
TGAGATGGTC AGACTAAACT 
TCAAGCATTT TATCCGTACT 
ATCCCCGGGA AAACAGCATT 
TGAAAATATT GTTGATGCGC 
TTCCTGTTTG TAATTGTCCT 
CAGGCGCAAT CACGAATGAA 
TGACGAGCGT AATGGCTGGC 
AGCTTTTGCC ATTCTCACCG 
CTTGATAACC TTATTTTTGA 
TGGACGAGTC GGAATCGCAG 



AACGTCTTGC TCGAGGCCGC 
ATGGGTATAA ATGGGCTCGC 
TATCGATTGT ATGGGAAGCC 
CAAAGGTAGC GTTGCCAATG 
GGCTGACGGA ATTTATGCCT 
CCTGATGATG CATGGTTACT 
CCAGGTATTA GAAGAATATC 
TGGCAGTGTT CCTGCGCCGG 
TTTAACAGCG ATCGCGTATT 
TAACGGTTTG GTTGATGCGA 
CTGTTGAACA AGTCTGGAAA 
GATTCAGTCG TCACTCATGG 
CGAGGGGAAA TTAATAGGTT 
ACCGATACCA GGATCTTGCC 

740 



GATTAAATTC 50 

GATAATGTCG 100 

CGATGCGCCA 150 

ATGTTACAGA 200 

CTTCCGACCA 250 

CACCACTGCG 300 

CTGATTCAGG 350 

TTGCATTCGA 400 

TCGTCTCGCT 450 

GTGATTTTGA 500 

GAAATGCATA 550 

TGATTTCTCA 600 

GTATTGATGT 650 

ATCCTATGGA 700 
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ACTGCCTCGG TGAGTTTTCT CCTTCATTAC AGAAACGGCT TTTTCAAAAA 750 
TATGGTATTG ATAATCCTGA TATGAATAAA TTGCAGTTTC ATTTGATGCT 800 
CGATGAGTTT TTCTAA 816 



2) INFORMATION FOR SEQ ID NO: 1392 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

15 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1392 
TGGGTGGAGA GGCTATTCGG CTAT 24 

20 



2) INFORMATION FOR SEQ ID NO: 1393 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

30 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1393 
35 CAGTCCCTTC CCGCTTCAGT GAC 23 



40 



2) INFORMATION FOR SEQ ID NO: 1394 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
45 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1394 

50 

GACGTTGTCA CTGAAGCGGG AAGG 24 
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2) INFORMATION FOR SEQ ID NO: 1395 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1395 
CTTGGTGGTC GAATGGGCAG GTAG 



2) INFORMATION FOR SEQ ID NO: 1396 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 795 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 
(C) ACCESSION NUMBER: V00618 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1396 

ATGATTGAAC AAGATGGATT GCACGCAGGT TCTCCGGCCG CTTGGGTGGA 50 

GAGGCTATTC GGCTATGACT GGGCACAACA GACAATCGGC TGCTCTGATG 100 

CCGCCGTGTT CCGGCTGTCA GCGCAGGGGC GCCCGGTTCT TTTTGTCAAG 150 

ACCGACCTGT CCGGTGCCCT GAATGAACTG CAGGACGAGG CAGCGCGGCT 200 

ATCGTGGCTG GCCACGACGG GCGTTCCTTG CGCAGCTGTG CTCGACGTTG 250 

TCACTGAAGC GGGAAGGGAC TGGCTGCTAT TGGGCGAAGT GCCGGGGCAG 300 

GATCTCCTGT CATCTCACCT TGCTCCTGCC GAGAAAGTAT CCATCATGGC 350 

TGATGCAATG CGGCGGCTGC ATACGCTTGA TCCGGCTACC TGCCCATTCG 400 

ACCACCAAGC GAAACATCGC ATCGAGCGAG CACGTACTCG GATGGAAGCC 4 50 

GGTCTTGTCG ATCAGGATGA TCTGGACGAA GAGCATCAGG GGCTCGCGCC 500 

AGCCGAACTG TTCGCCAGGC TCAAGGCGCG CATGCCCGAC GGCGAGGATC 550 

TCGTCGTGAC CCATGGCGAT GCCTGCTTGC CGAATATCAT GGTGGAAAAT 600 

GGCCGCTTTT CTGGATTCAT CGACTGTGGC CGGCTGGGTG TGGCGGACCG 650 

CTATCAGGAC ATAGCGTTGG CTACCCGTGA TATTGCTGAA GAGCTTGGCG 700 

GCGAATGGGC TGACCGCTTC CTCGTGCTTT ACGGTATCGC CGCTCCCGAT 750 

TCGCAGCGCA TCGCCTTCTA TCGCCTTCTT GACGAGTTCT TCTGA 795 



2) INFORMATION FOR SEQ ID NO: 1397 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1397 
GTGGGAGAAA ATGAAAACCT AT 22 



15 2) INFORMATION FOR SEQ ID NO: 1398 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 
20 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1398 

ATGGAGTGAA AGAGCCTGAT 20 



2) INFORMATION FOR SEQ ID NO: 1399 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 24 bases 
35 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1399 
ACCTATGATG TGGAACGGGA AAAG 24 

2) INFORMATION FOR SEQ ID NO: 1400 



(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1400 
CGATGGAGTG AAAGAGCCTG ATG 



2) INFORMATION FOR SEQ ID NO: 1401 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 795 bases 

(B) TYPE : • Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecalis 
(C) ACCESSION NUMBER: V01547 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1401 

ATGGCTAAAA TGAGAATATC ACCGGAATTG AAAAAACTGA TCGAAAAATA 50 

CCGCTGCGTA AAAGATACGG AAGGAATGTC TCCTGCTAAG GTATATAAGC 100 

TGGTGGGAGA AAATGAAAAC CTATATTTAA AAATGACGGA CAGCCGGTAT 150 

AAAGGGACCA CCTATGATGT GGAACGGGAA AAGGACATGA TGCTATGGCT 200 

GGAAGGAAAG CTGCCTGTTC CAAAGGTCCT GCACTTTGAA CGGCATGATG 250 

GCTGGAGCAA TCTGCTCATG AGTGAGGCCG ATGGCGTCCT TTGCTCGGAA 300 

GAGTATGAAG ATGAACAAAG CCCTGAAAAG ATTATCGAGC TGTATGCGGA 350 

GTGCATCAGG CTCTTTCACT CCATCGACAT ATCGGATTGT CCCTATACGA 4 00 

ATAGCTTAGA CAGCCGCTTA GCCGAATTGG ATTACTTACT GAATAACGAT 4 50 

CTGGCCGATG TGGATTGCGA AAACTGGGAA GAAGACACTC CATTTAAAGA 500 

TCCGCGCGAG CTGTATGATT TTTTAAAGAC GGAAAAGCCC GAAGAGGAAC 550 

TTGTCTTTTC CCACGGCGAC CTGGGAGACA GCAACATCTT TGTGAAAGAT 600 

GGCAAAGTAA GTGGCTTTAT TGATCTTGGG AGAAGCGGCA GGGCGGACAA 650 

GTGGTATGAC ATTGCCTTCT GCGTCCGGTC GATCAGGGAG GATATCGGGG 700 

AAGAACAGTA TGTCGAGCTA TTTTTTGACT TACTGGGGAT CAAGCCTGAT 7 50 

TGGGAGAAAA TAAAATATTA TATTTTACTG GATGAATTGT TTTAG 7 95 



2) INFORMATION FOR SEQ ID NO: 1402 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1402 
TATTCAACAA TTTATCGGAA ACAG 24 

5 

2) INFORMATION FOR SEQ ID NO: 1403 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

15 

(ii) MOLECULE TYPE;. DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1403 
20 TCAGAGAGCC AACTCAACAT TTT 23 



2) INFORMATION FOR SEQ ID NO: 1404 



25 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
30 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1404 

35 

AAACAGCGTT TTAGAGCCAA ATAA 24 

40 2) INFORMATION FOR SEQ ID NO: 1405 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 
45 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1405 

TTCTCAGAGA GCCAACTCAA CATT 24 
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2) INFORMATION FOR SEQ ID NO: 1406 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 780 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Acinetobacter bavmannii 

(B) STRAIN: BM2580 

(C) ACCESSION NUMBER: X07753 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1406 

ATGGAATTGC CCAATATTAT TCAACAATTT ATCGGAAACA GCGTTTTAGA 50 

GCCAAATAAA ATTGGTCAGT CGCCATCGGA TGTTTATTCT TTTAATCGAA 100 

ATAATGAAAC TTTTTTTCTT AAGCGATCTA GCACTTTATA TACAGAGACC 150 

ACATACAGTG TCTCTCGTGA AGCGAAAATG TTGAGTTGGC TCTCTGAGAA 200 

ATTAAAGGTG CCTGAACTCA TCATGACTTT TCAGGATGAG CAGTTTGAAT 250 

TCATGATCAC TAAAGCGATC AATGCAAAAC CAATTTCAGC GCTTTTTTTA 300 

ACAGACCAAG AATTGCTTGC TATCTATAAG GAGGCACTCA ATCTGTTAAA 350 

TTCAATTGCT ATTATTGATT GTCCATTTAT TTCAAACATT GATCATCGGT 4 00 

TAAAAGAGTC AAAATTTTTT ATTGATAACC AACTCCTTGA CGATATAGAT 4 50 

CAAGATGATT TTGACACTGA ATTATGGGGA GACCATAAAA CTTACCTAAG 500 

TCTATGGAAT GAGTTAACCG AGACTCGTGT TGAAGAAAGA TTGGTTTTTT 550 

CTCATGGCGA TATCACGGAT AGTAATATTT TTATAGATAA ATTCAATGAA 600 

ATTTATTTTT TAGATCTTGG TCGTGCTGGG TTAGCAGATG AATTTGTAGA 650 

TATATCCTTT GTTGAACGTT GCCTAAGAGA GGATGCATCG GAGGAAACTG 700 

CGAAAATATT TTTAAAGCAT TTAAAAAATG ATAGACCTGA CAAAAGGAAT 7 50 

TATTTTTTAA AACTTGATGA ATTGAATTGA 7 80 



2) INFORMATION FOR SEQ ID NO: 1407 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1407 
CCCTGTAATA GAAAAGCAAG TAGG 



2) INFORMATION FOR SEQ ID NO: 1408 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1408 
TTGTCGTATC CCTCAAATCA CC 



2) INFORMATION FOR SEQ ID NO: 1409. 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1409 
TGGGATTACA ATGGCAATCA GCG 



2) INFORMATION FOR SEQ ID NO: 1410 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1410 
GGGGAATAGG TCACAAGATC TGCTT 



2) INFORMATION FOR SEQ ID NO: 1411 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 912 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
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(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Pseudomonas aeruginosa 
(C) ACCESSION NUMBER: S46063 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1411 

ATGCTTTTAT ATAAAATGTG TGACAATCAA AATTATGGGG TTACTTACAT 50 

GAAGTTTTTA TTGGCATTTT CGCTTTTAAT ACCATCCGTG GTTTTTGCAA 100 

GTAGTTCAAA GTTTCAGCAA GTTGAACAAG ACGTTAAGGC AATTGAAGTT 150 

TCTCTTTCTG CTCGTATAGG TGTTTCCGTT' CTTGATACTC AAAATGGAGA 200 

ATATTGGGAT TACAATGGCA ATCAGCGCTT CCCGTTAACA AGTACTTTTA 250 

AAACAATAGC TTGCGCTAAA TTACTATATG ATGCTGAGCA AGGAAAAGTT 300 

AATCCCAATA GTACAGTCGA GATTAAGAAA GCAGATCTTG TGACCTATTC 350 

CCCTGTAATA GAAAAGCAAG TAGGGCAGGC AATCACACTC GATGATGCGT 400 

GCTTCGCAAC TATGACTACA AGTGATAATA CTGCGGCAAA TATCATCCTA 450 

AGTGCTGTAG GTGGCCCCAA AGGCGTTACT GATTTTTTAA GACAAATTGG 500 

GGACAAAGAG ACTCGTCTAG ACCGTATTGA GCCTGATTTA AATGAAGGTA 550 

AGCTCGGTGA TTTGAGGGAT ACGACAACTC CTAAGGCAAT AGCCAGTACT 600 

TTGAATAAAC TTTTATTTGG TTCCGCGCTA TCTGAAATGA ACCAGAAAAA 650 

ATTAGAGTCT TGGATGGTGA ACAATCAAGT CACTGGTAAT TTACTACGTT 700 

CAGTATTGCC GGCGGGATGG AACATTGCGG ATCGCTCAGG TGCTGGCGGA 750 

TTTGGTGCTC GGAGTATTAC AGCAGTTGTG TGGAGTGAGC ATCAAGCCCC 800 

AATTATTGTG AGCATCTATC TAGCTCAAAC ACAGGCTTCA ATGGCAGAGC 850 

GAAATGATGC GATTGTTAAA ATTGGTCATT CAATTTTTGA CGTTTATACA 900 

TCACAGTCGC GC 912 



2) INFORMATION FOR SEQ ID NO: 1412 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1412 
GAGAAAACGC TCCAGCAGGG C 



2) INFORMATION FOR SEQ ID NO: 1413 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 
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(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1413 
CATGAGGCTT TCACTGCGGG G 21 



2) INFORMATION FOR SEQ ID NO: 1414 



(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

20 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1414 
TATCGTTAAT CGCACCATCA C 21 

25 

2) INFORMATION FOR SEQ ID NO: 1415 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

35 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1415 
4 0 ATGCAGTAAT GCGGCTTTAT C 21 



2) INFORMATION FOR SEQ ID NO: 1416 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1146 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
50 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella pneumoniae 

(B) STRAIN: HEL-1 

(C) ACCESSION NUMBER: X91840 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1416 

ATGATGAAAA AATCGTTATG CTGCGCTCTG CTGCTGACAG CCTCTTTCTC 50 

CACATTTGCT GCCGCAAAAA CAGAACAACA GATTGCCGAT ATCGTTAATC 100 

GCACCATCAC CCCGTTGATG CAGGAGCAGG CTATTCCGGG TATGGCCGTT 150 

GCCGTTATCT ACCAGGGAAA ACCCTATTAT TTCACCTGGG GTAAAGCCGA 200 

TATCGCCAAT AACCACCCAG TCACGCAGCA AACGCTGTTT GAGCTAGGAT 250 

CGGTTAGTAA GACGTTTAAC GGCGTGTTGG^ GCGGCGATGC TATCGCCCGC 300 

GGCGAAATTA AGCTCAGCGA TCCGGTCACG' AAATACTGGC CAGAACTGAC 350 

AGGCAAACAG TGGCAGGGTA TCCGCCTGCT GCACTTAGCC ACCTATACGG 400 

CAGGCGGCCT ACCGCTGCAG ATCCCCGATG ACGTTAGGGA TAAAGCCGCA 4 50 

TTACTGCATT TTTATCAAAA CTGGCAGCCG CAATGGACTC CGGGCGCTAA 500 

GCGACTTTAC GCTAACTCCA GCATTGGTCT GTTTGGCGCG CTGGCGGTGA 550 

AACCCTCAGG AATGAGTTAC GAAGAGGCAA TGACCAGACG CGTCCTGCAA 600 

CCATTAAAAC TGGCGCATAC CTGGATTACG GTTCCGCAGA ACGAACAAAA 650 

AGATTATGCC TGGGGCTATC GCGAAGGGAA GCCCGTACAC GTTTCTCCGG 700 

GACAACTTGA CGCCGAAGCC TATGGCGTGA AATCCAGCGT TATTGATATG 750 

GCCCGCTGGG TTCAGGCCAA CATGGATGCC AGCCACGTTC AGGAGAAAAC 800 

GCTCCAGCAG GGCATTGCGC TTGCGCAGTC TCGCTACTGG CGTATTGGCG 850 

ATATGTACCA GGGATTAGGC TGGGAGATGC TGAACTGGCC GCTGAAAGCT 900 

GATTCGATCA TCAACGGCAG CGACAGCAAA GTGGCATTGG CAGCGCTTCC 950 

CGCCGTTGAG GTAAACCCGC CCGCCCCCGC AGTGAAAGCC TCATGGGTGC 1000 

ATAAAACGGG CTCCACTGGT GGATTTGGCA GCTACGTAGC CTTCGTTCCA 1050 

GAAAAAAACC TTGGCATCGT GATGCTGGCA AACAAAAGCT ATCCTAACCC 1100 

TGTCCGTGTC GAGGCGGCCT GGCGCATTCT TGAAAAGCTG CAATAA 114 6 



2) INFORMATION FOR SEQ ID NO: 1417 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1417 
TGGTTAACTA YAATCCSATT GCGGA 



2) INFORMATION FOR SEQ ID NO: 1418 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 22 bases 
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(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1418 
ATGCTTTACC CAGCGTCAGA TT 22 



2) INFORMATION FOR SEQ ID NO:' 1419 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

20 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1419 
25 CGATGAATAA GCTGATTTCT CACG 24 



2) INFORMATION FOR SEQ ID NO: 1420 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
35 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1420 

40 

TGCTTTACCC AGCGTCAGAT TACG 24 



45 2) INFORMATION FOR SEQ ID NO: 1421 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 
50 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1421 
AATTAGAGCG GCAGTCGGGA GGAA 24 

5 

2) INFORMATION FOR SEQ ID NO: 1422 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single t 

(D) TOPOLOGY: Linear 

15 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1422 
20 GAAATCAGCT TATTCATCGC CACG 24 



2) INFORMATION FOR SEQ ID NO: 1423 

25 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 876 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
30 (D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 
35 (A) ORGANISM: Escherichia coli 

(B) STRAIN: GRI-1 

(C) ACCESSION NUMBER: X92506 



(xi) SEQUENCE DESCRIPTION: SEQ 

ATGGTTAAAA AATCACTGCG TCAGTTCACG 
CACGCTGTTG TTAGGAAGTG TGCCGCTGTA 
AGCAAAAACT TGCCGAATTA GAGCGGCAGT 
GCATTGATTA ACACAGCAGA TAATTCGCAA 
GCGCTTTGCG ATGTGCAGCA CCAGTAAAGT 
TGAAGAAAAG TGAAAGCGAA CCGAATCTGT 
AAAAAATCTG ACTTGGTTAA CTATAATCCG 
TGGGACGATG TCACTGGCTG AGCTTAGCGC 
ATAACGTGGC GATGAATAAG CTGATTTCTC 
GTCACCGCGT TCGCCCGACA GCTGGGAGAC 
TACCGAGCCG ACGTTAAACA CCGCCATTCC 
CTTCACCTCG GGCAATGGCG CAAACTCTGC 
GCATTGGGTG ACAGCCAACG GGCGCAGCTG 



ID NO: 1423 



CTGATGGCGA 


CGGCAACCGT 


50 


TGCGCAAACG 


GCGGACGTAC 


100 


CGGGAGGAAG 


ACTGGGTGTG 


150 


ATACTTTATC 


GTGCTGATGA 


200 


GATGGCCGTG 


GCCGCGGTGC 


250 


TAAATCAGCG 


AGTTGAGATC 


300 


ATTGCGGAAA 


AGCACGTCGA 


350 


GGCCGCGCTA 


CAGTACAGCG 


400 


ACGTTGGCGG 


CCCGGCTAGC 


450 


GAAACGTTCC 


GTCTCGACCG 


500 


GGGCGATCCG 


CGTGATACCA 


550 


GTAATCTGAC 


GCTGGGTAAA 


600 


GTGACATGGA 


TGAAAGGCAA 


650 
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TACCACCGGT GCAGCGAGCA TTCAGGCTGG 

TGGGGGATAA AACCGGCAGC GGTGACTATG 

GTGATCTGGC CAAAAGATCG TGCGCCGCTG 

CCAGCCTCAA CCTAAGGCAG AAAGCCGTCG 

5 CTAAAATCGT CACCAACGGT TTGTAA 



PCT/CA00/01150 

ACTGCCTGCT TCCTGGGTTG 700 
GCACCACCAA CGATATCGCG 750 
ATTCTGGTCA CTTACTTCAC 800 
CGATGTATTA GCGTCGGCGG 850 

876 



2) INFORMATION FOR SEQ ID NO: 1424 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
15 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1424 

20 

GTTAACGGTG ATGGCGACGC TAC 23 



25 2) INFORMATION FOR SEQ ID NO: 1425 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 
30 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1425 

GAATTATCGG CGGTGTTAAT CAGC 24 



40 



50 



2) INFORMATION FOR SEQ ID NO: 1426 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 22 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1426 

CACGCTCAAT ACCGCCATTC CA 22 
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2) INFORMATION FOR SEQ ID NO: 1427 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1427 
TTATCGCCCA CTACCCATGA TTTC 



2) INFORMATION FOR SEQ ID NO: 1428 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 876 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella typhimurium 

(B) STRAIN: CAS-5 

(C) ACCESSION NUMBER: X92507 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1428 

ATGATGACTC AGAGCATTCG CCGCTCAATG TTAACGGTGA TGGCGACGCT 50 

ACCCCTGCTA TTTAGCAGCG CAACGCTGCA TGCGCAGGCG AACAGCGTGC 100 

AACAGCAGCT GGAAGCCCTG GAGAAAAGTT CGGGAGGTCG GCTTGGCGTT 150 

GCGCTGATTA ACACCGCCGA TAATTCGCAG ATTCTCTACC GTGCCGATGA 200 

ACGTTTTGCG ATGTGCAGTA CCAGTAAGGT GATGGCGGCC GCGGCGGTGC 250 

TTAAACAGAG CGAGAGCGAT AAGCACCTGC TAAATCAGCG CGTTGAAATC 300 

AAGAAGAGCG ACCTGGTTAA CTACAATCCC ATTGCGGAGA AACACGTTAA 350 

CGGCACGATG ACGCTGGCTG AGCTTGGCGC AGCGGCGCTG CAGTATAGCG 400 

ACAATACTGC CATGAATAAG CTGATTGCCC ATCTGGGTGG TCCCGATAAA 450 

GTGACGGCGT TTGCTCGCTC GTTGGGTGAT GAGACCTTCC GTCTGGACAG 500 

AACCGAGCCC ACGCTCAATA CCGCCATTCC AGGCGACCCG CGTGATACCA 550 

CCACGCCGCT CGCGATGGCG CAGACCCTGA AAAATCTGAC GCTGGGTAAA 600 

GCGCTGGCGG AAACTCAGCG GGCACAGTTG GTGACGTGGC TTAAGGGCAA 650 

TACTACCGGT AGCGCGAGCA TTCGGGCGGG TCTGCCGAAA TCATGGGTAG 700 

TGGGCGATAA AACCGGCAGC GGAGATTATG GCACCACCAA CGATATCGCG 750 

GTTATCTGGC CGGAAAACCA CGCACCGCTG GTTCTGGTGA CCTACTTTAC 800 

CCAACCGGAG CAGAAGGCGG AAAGCCGTCG GGATATTCTG GCTGCGGCGG 850 

CGAAAATCGT AACCCACGGT TTCTGA 876 
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2) INFORMATION FOR SEQ ID NO: 1429 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1429 
TTTACGGCTA AAGATACTGA AAAGT 25 



2) INFORMATION FOR SEQ ID NO: 1430 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: DNA. 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1430 
GTTTAATAAA ACAACCACCG AATAAT 26 

30. 

2) INFORMATION FOR SEQ ID NO: 1431 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

40 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1431 
45 TAATTGACAC TCCATTTACG GCTAA 25 



2) INFORMATION FOR SEQ ID NO: 1432 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 
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(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1432 
ACCGAATAAT ATTTTCCTTT CAGGCA 



2) INFORMATION FOR SEQ ID NO: 1433 

(i) SEQUENCE CHARACTERISTICS:' 

(A) LENGTH: 741 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Pseudomonas aeruginosa 
(C) ACCESSION NUMBER: AJ223604 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1433 

ATGAGCAAGT TATCTGTATT CTTTATATTT TTGTTTTGCA GCATTGCTAC 50 

CGCAGCAGAG TCTTTGCCAG ATTTAAAAAT TGAAAAGCTT GATGAAGGCG 100 

TTTATGTTCA TACTTCGTTT GAAGAAGTTA ACGGGTGGGG CGTTGTTCCT 150 

AAACATGGTT TGGTGGTTCT TGTAAATGCT GAGGCTTACC TAATTGACAC 200 

TCCATTTACG GCTAAAGATA CTGAAAAGTT AGTCACTTGG TTTGTGGAGC 250 

GTGGCTATAA AATAAAAGGC AGCATTTCCT CTCATTTTCA TAGCGACAGC 300 

ACGGGCGGAA TAGAGTGGCT TAATTCTCGA TCTATCCCCA CGTATGCATC 350 

TGAATTAACA AATGAACTGC TTAAAAAAGA CGGTAAGGTT CAAGCCACAA 4 00 

ATTCATTTAG CGGAGTTAAC TATTGGCTAG TTAAAAATAA AATTGAAGTT 4 50 

TTTTATCCAG GCCCGGGACA CACTCCAGAT AACGTAGTGG TTTGGTTGCC 500 

TGAAAGGAAA ATATTATTCG GTGGTTGTTT TATTAAACCG TACGGTTTAG 550 

GCAATTTGGG TGACGCAAAT ATAGAAGCTT GGCCAAAGTC CGCCAAATTA 600 

TTAAAGTCCA AATATGGTAA GGCAAAACTG GTTGTTCCAA GTCACAGTGA 650 

AGTTGGAGAC GCATCACTCT TGAAACTTAC ATTAGAGCAG GCGGTTAAAG 700 

GGTTAAACGA AAGTAAAAAA CCATCAAAAC CAAGCAACTA A 741 



2) INFORMATION FOR SEQ ID NO: 1434 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1434 
5 CACAATCAAG ACCAAGATTT GCGAT 25 



10 



2) INFORMATION FOR SEQ ID NO: 1435 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
15 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1435 

20 

GAAAGGGCAG CTCGTTACGA TAGAG 25 



25 2) INFORMATION FOR SEQ ID NO: 14 36 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 
30 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1436 

CAGCATCAAC ATTTAAGATC CCCA 24 



40 



50 



2) INFORMATION FOR SEQ ID NO: 1437 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 26 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1437 

CTCCACTTGA TTAACTGCGG AAATTC 26 
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2) INFORMATION FOR SEQ ID NO: 1438 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 828 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 
(C) ACCESSION NUMBER: X06046 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1438 

ATGGCAATCC GAATCTTCGC GATACTTTTC TCCATTTTTT CTCTTGCCAC 50 

TTTCGCGCAT GCGCAAGAAG GCACGCTAGA ACGTTCTGAC TGGAGGAAGT 100 

TTTTCAGCGA ATTTCAAGCC AAAGGCACGA TAGTTGTGGC AGACGAACGC 150 

CAAGCGGATC GTGCCATGTT GGTTTTTGAT CCTGTGCGAT CGAAGAAACG 200 

CTACTCGCCT GCATCGACAT TCAAGATACC TCATACACTT TTTGCACTTG 250 

ATGCAGGCGC TGTTCGTGAT GAGTTCCAGA TTTTTCGATG GGACGGCGTT 300 

AACAGGGGCT TTGCAGGCCA CAATCAAGAC CAAGATTTGC GATCAGCAAT 350 

GCGGAATTCT ACTGTTTGGG TGTATGAGCT ATTTGCAAAG GAAATTGGTG 400 

ATGACAAAGC TCGGCGCTAT TTGAAGAAAA TCGACTATGG CAACGCCGAT 4 50 

CCTTCGACAA GTAATGGCGA TTACTGGATA GAAGGCAGCC TTGCAATCTC 500 

GGCGCAGGAG CAAATTGCAT TTCTCAGGAA GCTCTATCGT AACGAGCTGC 550 

CCTTTCGGGT AGAACATCAG CGCTTGGTCA AGGATCTCAT GATTGTGGAA 600 

GCCGGTCGCA ACTGGATACT GCGTGCAAAG ACGGGCTGGG AAGGCCGTAT 650 

GGGTTGGTGG GTAGGATGGG TTGAGTGGCC GACTGGCTCC GTATTCTTCG 700 

CACTGAATAT TGATACGCCA AACAGAATGG ATGATCTTTT CAAGAGGGAG 750 

GCAATCGTGC GGGCAATCCT TCGCTCTATT GAAGCGTTAC CGCCCAACCC 800 

GGCAGTCAAC TCGGACGCTG CGCGATAA 828 



2) INFORMATION FOR SEQ ID NO: 1439 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 801 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Pseudomonas aeruginosa 
(C) ACCESSION NUMBER: J03427 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1439 
ATGAAAACAT TTGCCGCATA TGTAATTATC GCGTGTCTTT CGAGTACGGC 50 

758 
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ATTAGCTGGT TCAATTACAG AAAATACGTC TTGGAACAAA GAGTTCTCTG 100 

CCGAAGCCGT CAATGGTGTC TTCGTGCTTT GTAAAAGTAG CAGTAAATCC 150 

TGCGCTACCA ATGACTTAGC TCGTGCATCA AAGGAATATC TTCCAGCATC 200 

AACATTTAAG ATCCCCAACG CAATTATCGG CCTAGAAACT GGTGTCATAA 250 

5 AGAATGAGCA TCAGGTTTTC AAATGGGACG GAAAGCCAAG AGCCATGAAG 300 

CAATGGGAAA GAGACTTGAC CTTAAGAGGG GCAATACAAG TTTCAGCTGT 350 

TCCCGTATTT CAACAAATCG CCAGAGAAGT TGGCGAAGTA AGAATGCAGA 4 00 

AATACCTTAA AAAATTTTCC TATGGCAACC AGAATATCAG TGGTGGCATT 4 50 

GACAAATTCT GGTTGGAAGG CCAGCTTAGA ATTTCCGCAG TTAATCAAGT 500 

10 GGAGTTTCTA GAGTCTCTAT ATTTAAATAA ATTGTCAGCA TCTAAAGAAA 550 

ACCAGCTAAT AGTAAAAGAG GCTTTGGTAA CGGAGGCGGC ACCTGAATAT • 600 

CTAGTGCATT CAAAAACTGG TTTTTCTGGT GTGGGAACTG AGTCAAATCC 650 

TGGTGTCGCA TGGTGGGTTG GGTGGGTTGA GAAGGAGACA GAGGTTTACT 700 

TTTTCGCCTT TAACATGGAT ATAGACAACG AAAGTAAGTT GCCGCTAAGA 7 50 

15 AAATCCATTC CCACCAAAAT CATGGAAAGT GAGGGCATCA TTGGTGGCTA 800 

A . 801 



20 2) INFORMATION FOR SEQ ID NO: 1440 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 
25 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1440 

AGACCGTTAT CGTAAACAGG GCTAAG 26 



35 



45 



50 



2) INFORMATION FOR SEQ ID NO: 1441 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 26 bases 
40 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1441 
TTTTTTGCTC AAACTTTTTC AGGATC 26 

2) INFORMATION FOR SEQ ID NO: 1442 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 927 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
5 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
10 (A) ORGANISM: Pseudomonas aeruginosa 

(B) STRAIN: RNL-1 
. (C) ACCESSION NUM3ER: Z21957 



15 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1442 



ATGAATGTCA TTATAAAAGC TGTAGTTACT GCCTCGACGC TACTGATGGT 50 

ATCTTTTAGT TCATTCGAAA CCTCAGCGCA ATCCCCACTG TTAAAAGAGC 100 

AAATTGAATC CATAGTCATT GGAAAAAAAG CCACTGTAGG CGTTGCAGTG 150 

TGGGGGCCTG ACGATCTGGA ACCTTTACTG ATTAATCCTT TTGAAAAATT 200 

20 CCCAATGCAA AGTGTATTTA AATTGCATTT AGCTATGTTG GTACTGCATC 250 

AGGTTGATCA GGGAAAGTTG GATTTAAATC AGACCGTTAT CGTAAACAGG 300 

GCTAAGGTTT TACAGAATAC CTGGGCTCCG ATAATGAAAG CGTATCAGGG 350 

AGACGAGTTT AGTGTTCCAG TGCAGCAACT GCTGCAATAC TCGGTCTCGC 400 

ACAGCGATAA CGTGGCCTGT GATTTGTTAT TTGAACTGGT TGGTGGACCA 450 

25 GCTGCTTTGC ATGACTATAT CCAGTCTATG GGTATAAAGG AGACCGCTGT 500 

GGTCGCAAAT GAAGCGCAGA TGCACGCCGA TGATCAGGTG CAGTATCAAA 550 

ACTGGACCTC GATGAAAGGT GCTGCAGAGA TCCTGAAAAA GTTTGAGCAA 600 

AAAACACAGC TGTCTGAAAC CTCGCAGGCT TTGTTATGGA AGTGGATGGT 650 

CGAAACCACC ACAGGACCAG AGCGGTTAAA AGGTTTGTTA CCAGCTGGTA 700 

30 CTGTGGTCGC ACATAAAACT GGTACTTCGG GTATCAAAGC CGGAAAAACT 750 

GCGGCCACTA ATGATTTAGG TATCATTCTG TTGCCTGATG GACGGCCCTT 800 

GCTGGTTGCT GTTTTTGTGA AAGACTCAGC CGAGTCAAGC CGAACCAATG 850 

AAGCTATCAT TGCGCAGGTT GCTCAGACTG CGTATCAATT TGAATTGAAA 900 

AAGCTTTCTG CCCTAAGCCC AAATTAA 927 

35 



2) INFORMATION FOR SEQ ID NO: 1443 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

45 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1443 
50 CTTCTGCTCT GCTGATGCTT GGC 23 
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2) INFORMATION FOR SEQ ID NO: 1444 



PCT/CAOO/01150 



10 



15 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1444 
GGCGACCAGG TATTTTGTAA TACTGC 26 

2) INFORMATION FOR SEQ ID NO: 1445 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 927 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella typhimurium 

(B) STRAIN: JMC 

30 (C) ACCESSION NUMBER: X93314 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1445 

ATGAATGTCA TCACAAAATG TGTTTTCACC GCTTCTGCTC TGCTGATGCT 50 

35 TGGCTTAAGT TCATTTGTAG TATCAGCCCA ATCCCCTTTG TTAAAAGAGC 100 

AGATTGAAAC CATAGTGACG GGTAAAAAGG CCACTGTAGG TGTAGCAGTG 150 

TGGGGGCCTG ACGATCTGGA ACCTTTGTTG CTGAATCCAT TTGAAAAGTT 200 

TCCGATGCAA AGTGTGTTTA AACTGCATTT AGCTATGTTA GTTCTGCATC 250 

AGGTCGATCA GGGGAAACTG GATTTAAATC AGTCTGTTAC TGTTAATCGT 300 

40 GCTGCAGTAT TACAAAATAC CTGGTCGCCA ATGATGAAAG ATCATCAGGG 350 

CGATGAATTT ACTGTTGCAG TACAGCAGTT ACTGCAGTAT TCGGTGTCAC 400 

ACAGCGACAA TGTGGCCTGC GATTTGTTAT TTGAACTGGT GGGCGGGCCG 450 

CAAGCTTTGC ATGCTTATAT CCAGTCTTTA GGCGTTAAAG AAGCTGCCGT 500 

GGTAGCAAAT GAAGCGCAAA TGCATGCGGA TGATCAGGTG CAATATCAAA 550 

45 ACTGGACGTC GATGAAAGCC GCAGCACAAG TTCTGCAAAA GTTTGAACAG 600 

AAAAAGCAGT TGTCTGAAAC CTCTCAGGCC TTGTTATGGA AATGGATGGT 650 

TGAAACCACC ACAGGACCAC AGCGGTTAAA AGGCTTGTTA CCTGCTGGTA 700 

CTATAGTGGC GCATAAAACC GGTACTTCGG GCGTCAGAGC AGGAAAAACT 750 

GCGGCGACTA ATGATGCGGG CGTCATTATG TTGCCTGATG GACGGCCTTT 800 

50 ATTGGTGGCG GTATTTGTCA AGGATTCGGC TGAATCAGAA CGAACCAATG 850 

AAGCTATTAT TGCGCAGGTT GCGCAAGCGG CTTATCAGTT TGAGCTGAAA 900 

AAACTCTCTG CAGTGAGTCC GGATTGA 927 
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2) INFORMATION FOR SEQ ID NO: 1446 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1446 
GGCCTGYGAT TTGTTATTTG AACTGGT 



2) INFORMATION FOR SEQ ID NO: 1447 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1447 
CGCTSTGGTC CTGTGGTGGT TTC 



2) INFORMATION FOR SEQ ID NO: 1448 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1448 
GATCAGGTGC ARTATCAAAA CTGGAC 



2) INFORMATION FOR SEQ ID NO: 144 9 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 
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(C) STRANDEDNESS:' Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1449 
AGCWGGTAAC AAYCCTTTTA ACCGCT 26 



2) INFORMATION FOR SEQ ID NO: 1450 



(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

20 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1450 
ACCACTGGGA ATACACTTGT AATGGC 26 

25 

2) INFORMATION FOR SEQ ID NO: 1451 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

35 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1451 
40 ATCTACCTGG TCAATCATTG CTTCGT 26 



2) INFORMATION FOR SEQ ID NO: 1452 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 486 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
50 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus epidermidis 

(B) STRAIN: BM10393 

(C) ACCESSION NUMBER: AF045472 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1452 

ATGACATTAT CAATAATTGT CGCTCACGAT AAACAAAGAG TCATTGGGTA 50 

CCAAAATCAA TTACCTTGGC ACTTACCAAA TGATTTAAAG CATATTAAAC 100 

AACTGACCAC TGGGAATACA CTTGTAATGG CACGGAAAAC TTTTAATTCT 150 

ATAGGGAAGC CATTGCCAAA TAGACGTAAC GTCGTACTCA CTAACCAAGC 200 

TTCATTTCAC CATGAAGGGG TAGATGTTAT AAACTCTCTT GATGAAATTA 250 

AAGAGTTATC TGGTCATGTT TTTATATTTG, GAGGACAAAC GTTATACGAA 300 

GCAATGATTG ACCAGGTAGA TGATATGTAT ATCACAGTAA TAGATGGAAA 350 

GTTTCAAGGA GACACATTCT TTCCACCATA CACATTCGAA AACTGGGAAG 400 

TCGAATCTTC AGTAGAAGGT CAACTAGATG AAAAAAATAC TATACCGCAT 4 50 

ACATTCTTAC ATTTAGTGCG TAGAAAAGGG AAATAG 486 



2) INFORMATION FOR SEQ ID NO: 1453 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1453 
ATCGAAGAAT GGAGTTATCG GRAATG 



2) INFORMATION FOR SEQ ID NO: 1454 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1454 
CCTAAAAYTR 'CTGGGGATTT CWGGA 



2) INFORMATION FOR SEQ ID NO: 1455 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1455 
CAGGTGGTGG GGAGATATAC AAAA 



2) INFORMATION FOR SEQ ID NO: 1456 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1456 
TATGTTAGAS RCGAAGTCTT GGKTAA 



2) INFORMATION FOR SEQ ID NO: 1457 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1457 
CAAAGGTGAA CAGCTCCTGT TT 



2) INFORMATION FOR SEQ ID NO: 1458 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
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(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1458 
TCCGTTATTT TCTTTAGGTT GGTTAAA 

2) INFORMATION FOR SEQ ID NO: 1459 

(i) SEQUENCE CHARACTERISTICS:, 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRAN DEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1459 
AAGGTGAACA GCTCCTGTTT 



2) INFORMATION FOR SEQ ID NO: 1460 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1460 
GATCACTACG TTCTCATTGT CA 



2) INFORMATION FOR SEQ ID NO: 1461 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 474 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
(vi) ORIGINAL SOURCE: 
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(A) ORGANISM: Escherichia coli 
(C) ACCESSION NUMBER: AJ238350 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1461 



2) INFORMATION FOR SEQ ID NO: 1462 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 62 
GCACTCCCYA ATAGGAAATA CGC 



2) INFORMATION FOR SEQ ID NO: 1463 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1463 
AGTGTTGCTC AAAAACAACT TCG 



2) INFORMATION FOR SEQ ID NO: 1464 
(i) SEQUENCE CHARACTERISTICS: 



GTGAAACTAT CACTAATGGT 
TGGCCCTGAT ATTCCATGGA 
CTATTACCTA TAACCAATGG 
ATGGGAGCAT TACCCAACCG 
TACATCTGAC AATGAGAACG 
TAACCAACCT AAAGAAAATA 
GAGATATACA AAAGCCTGAT 
AATAGACATC GAGCCGGAAG 
ATTTTAGGCC AGTTTTTACC 
TACCAAATCT GGCAAAAGGG 



AGCTATATCG AAGAATGGAG TTATCGGGAA 
GTGCCAAAGG TGAACAGCTC CTGTTTAAAG 
CTGTTGGTTG GACGCAAGAC TTTTGAATCA 
AAAGTATGCG GTCGTAACAC GTCCAAGTTT 
TAGTGATCTT TCCATCAATT AAAGATGCTT 
ACGGATCATG TCATTGTTTC AGGTGGTGGG 
CGATCAAGTA GATACACTAC ATATATCTAC 
GTGATGTTTA CTTTCCTGAA ATCCCCAGCA 
CAAGACTTCG CCTCTAACAT AAATTATAGT 
TTAA 



50 
100 
150 
200 
250 
300 
350 
400 
450 
474 
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(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid' 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

5 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1464 
10 ACGTTYGAAT CTATGGGMGC ACT 23 



15 



2) INFORMATION FOR SEQ ID NO: 1465 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
20 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1465 

25 

GTCGATAAGT GGAGCGTAGA GGC 23 



30 2) INFORMATION FOR SEQ ID NO: 1466 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 
35 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1466 

AAGCATTGAC CTACAATCAG TGT 23 



45 



2) INFORMATION FOR SEQ ID NO: 1467 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 27 bases 
50 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1467 
5 AATACAACTA CATTGTCATC ATTTGAT 27 



2) INFORMATION FOR SEQ ID NO: 1468 



10 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
15 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1468 

20 

CGTTACCCGC TCAGGTTGGA CATCAA 26 



25 2) INFORMATION FOR SEQ ID NO: 1469 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 
30 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1469 

CATCCCCCTC TGGCTCGATG TCG 23 



40 



50 



2) INFORMATION FOR SEQ ID NO: 1470 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 474 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia' coli 
(C) ACCESSION NUMBER: Z50804 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1470 



TTGAAAGTAT CATTGATAGC 
CGGTCCAGAC ATACCGTGGT 
CATTGACCTA CAATCAGTGT 
ATGGGCGCAC TCCCCAATAG 
GACATCAAAT GATGACAATG 
TGGACAGGCT AGCTGAATTC 
GAAATTTACC GAGAAACATT 
GATCGACATC GAGCCAGAGG 
CCTTCGAAGT TGTTTTTGAG 
TATCAAATTT GGAAAAAGGG 



TGCGAAACGA AAAAACGGCG TGATTGGTTG 
CCGCGAAAGG GGAGCAGCTA CTTTTTAAAG 
CTTCTGGTGG GTCGCAAGAC GTTTGAATCT 
GAAATACGCG GTCGTTACCC GCTCAGGTTG 
TAGTTGTATT TCAGTCAATC GAAGAGGCCA 
ACCGGTCACG TTATAGTGTC TGGTGGCGGA 
ACCCATGGCC TCTACGCTCC ACTTATCGAC 
GGGATGTTTT CTTCCCGAGT ATTCCAAATA 
CAACACTTTA CTTCAAACAT TAACTATTGC 
TTAA 



50 
100 
150 
200 
250 
300 
350 
400 
450 
474 



2) INFORMATION FOR SEQ ID NO: 1471 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRAN DEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1471 
GATAATGACA ACGTAATAGT ATTCCC 2 



2) INFORMATION FOR SEQ ID NO: 1472 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1472 
GCTCAATATC AATCGTCGAT ATA 2 



2) INFORMATION FOR SEQ ID NO: 1473 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 
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(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1473 
TTAAAGCCTT GACGTACAAC CAGTGG 



2) INFORMATION FOR SEQ ID NO: 1474 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1474 
TGGGCAATGT TTCTCTGTAA ATCTCC 



2) INFORMATION FOR SEQ ID NO: 1475 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 474 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 
(C) ACCESSION NUMBER: X12868 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1475 

GTGAAAGTAT CATTAATGGC TGCAAAAGCG AAAAACGGAG TGATTGGTTG 50 

CGGTCCACAC ATACCCTGGT CCGCGAAAGG AGAGCAGCTA CTCTTTAAAG 100 

CCTTGACGTA CAACCAGTGG CTTTTGGTGG GCCGCAAGAC GTTCGAATCT 150 

ATGGGAGCAC TCCCTAATAG GAAATACGCG GTCGTTACTC GCTCAGCCTG 200 

GACGGCCGAT AATGACAACG TAATAGTATT CCCGTCGATC GAAGAGGCCA 250 

TGTACGGGCT GGCTGAACTC ACCGATCACG TTATAGTGTC TGGTGGCGGG 300 

GAGATTTACA GAGAAACATT GCCCATGGCC TCTACGCTCC ATATATCGAC 350 

GATTGATATT GAGCCGGAAG GAGATGTTTT CTTTCCGAAT ATTCCCAATA 400 

CCTTCGAAGT TGTTTTTGAG CAACACTTTA GCTCAAACAT TAACTATTGC 4 50 

TATCAAATTT GGCAAAAGGG TTAA 4 74 
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2) INFORMATION FOR SEQ ID NO: 1476 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1476 
GGCGAGCAGC TCCTATTCAA AG 



2) INFORMATION FOR SEQ ID NO: 1477 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1477 
TAGGTAAGCT AATGCCGATT CAACA 



2) INFORMATION FOR SEQ ID NO: 1478 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1478 
GAGAATGGAG TAATTGGCTC TGGATT 



2) INFORMATION FOR SEQ ID NO: 1479 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

772 

SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



10 



40 



(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1479 
GCGAAATACA CAACATCAGG GTCAT 25 

2) INFORMATION FOR SEQ ID NO: 1480 



(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 474 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

20 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Proteus mirabilis 

(B) STRAIN: J120 

25 (C) ACCESSION NUMBER: Z86002 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1480 

ATGAAAATAT CTCTTATGGC AGCTGTTTCC GAGAATGGAG TAATTGGCTC 50 

3 0 TGGATTGGAT ATACCTTGGC ATGTACAAGG CGAGCAGCTC CTATTCAAAG 100 

CCATGACTTA CAATCAATGG CTTCTAGTTG GTCGTAAAAC CTTCGACTCA 150 

ATGGGTAAAC TTCCGAATAG AAAATATGCA GTGGTTACTC GTTCTAAAAT 200 

TATCTCGAAT GACCCTGATG TTGTGTATTT CGCAAGTGTT GAATCGGCAT 250 

TAGCTTACCT AAACAATGCG ACAGCACATA TCTTTGTTTC TGGTGGTGGT 300 

35 GAAATATATA AAGCTTTAAT CGATCAAGCA GATGTTATCC ATCTTTCAGT 350 

GATTCACAAG CATATCTCTG GCGATGTGTT TTTTCCTCCA GTTCCACAGG 4 00 

GCTTCAAGCA AACATTTGAG CAAAGTTTCA GTTCAAATAT TGATTACACG 4 50 

TACCAAATTT GGGCAAAGGG CTAA 47 4 



2) INFORMATION FOR SEQ ID NO: 1481 



(i) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

50 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1481 
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5 2) INFORMATION FOR SEQ ID NO: 1482 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 
10 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1482 

TAATTTATAT TAGACAWAAA AAACTG 26 



20 



30 



35 



2) INFORMATION FOR SEQ ID NO: 1483 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 23 bases 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1483 
CARYGTCAGA AAATGGCGTA ATC 23 

2) INFORMATION FOR SEQ ID NO: 1484 



(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

45 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1484 
TKCAAAGCRW TTTCTATTGA AGGAAA 26 

50 

2) INFORMATION FOR SEQ ID NO: 1485 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1485 
AAAATGGCGT AATCGGTAAT GGC 



2) INFORMATION FOR SEQ ID NO: 1486 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1486 
CATTTGAGCT TGAAATTCCT TTCCTC 



2) INFORMATION FOR SEQ ID NO: 1487 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1487 
AATCGAAAAT ATGCAGTAGT GTCGAG 



2) INFORMATION FOR SEQ ID NO: 1488 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
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(ii) MOLECULE TYPE: DNA 
5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1488 

AGACTATTGT AGATTTGACC GCCA 24 



10 



20 



25 



40 



50 



2) INFORMATION FOR SEQ ID NO: 1489 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 474 bases 
15 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 

(B) STRAIN: VA292 

(C) ACCESSION NUMBER: U31119 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1489 



TTGAAAATTT CATTGATTTC TGCAACGTCA GAAAATGGCG TAATCGGTAA 50 

TGGCCCTGAT ATCCCATGGT CAGCAAAAGG TGAGCAGTTA CTCTTTAAAG 100 

30 CGCTCACATA TAATCAGTGG CTCCTTGTTG GAAGGAAAAC ATTTGACTCT 150 

ATGGGTGTTC TTCCAAATCG AAAATATGCA GTAGTGTCGA GGAAAGGAAT 200 

TTCAAGCTCA AATGAAAATG TATTAGTCTT TCCTTCAATA GAAATCGCTT 250 

TGCAAGAACT ATCGAAAATT ACAGATCATT TATATGTCTC TGGTGGCGGT 300 

CAAATCTACA ATAGTCTTAT TGAAAAAGCA GATATAATTC ATTTGTCTAC 350 

35 TGTTCACGTT GAGGTTGAAG GTGATATCAA TTTTCCTAAA ATTCCAGAGA 400 

ATTTCAATTT GGTTTTTGAG CAGTTTTTTT TGTCTAATAT AAATTACACA 450 

TATCAGATTT GGAAAAAAGG CTAA 47 4 



2) INFORMATION FOR SEQ ID NO: 1490 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 25 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1490 

GACCTATGAG AGCTTGCCCG TCAAA 25 

776 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



2) INFORMATION FOR SEQ ID NO: 1491 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1491 
TCGCCTTCGT ACAGTCGCTT AACAAA 



2) INFORMATION FOR SEQ ID NO: 1492 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1492 
CATTTTAGCT GCCACCGCCA ATGGTT 



2) INFORMATION FOR SEQ ID NO: 1493 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1493 
GCGTCGCTGA CGTTGTTCAC GAAGA 



2) INFORMATION FOR SEQ ID NO: 1494 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 510 bases 

(B) TYPE: Nucleic acid 
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(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 

(B) STRAIN: BL26A 

(C) ACCESSION NUMBER: U10186 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1494 

ATGATCGAGC TTCATGCCAT TTTAGCTGCC ACCGCCAATG GTTGCATTGG 50 

GAAGGACAAC GCACTTCCCT GGCCACCACT AAAAGGCGAT CTGGCCAGAT 100 

TCAAAAAATT GACCATGGGG AAGGTGGTCA TTATGGGGCG CAAGACCTAT 150 

GAGAGCTTGC CCGTCAAATT AGAAGGTCGC ACCTGCATCG TTATGACGCG 200 

CCAAGCGCTG GAGCTTCCGG GTGTTCGTGA CGCTAACGGC GCTATCTTCG 250 

TGAACAACGT CAGCGACGCC ATGCGGTTCG CTCAAGAAGA GAGCGTGGGC 300 

GATGTGGCCT ACGTCATTGG TGGCGCTGAG ATATTCAAGC GACTTGCCTT 350 

GATGATCACG CAGATTGAAT TGACCTTTGT TAAGCGACTG TACGAAGGCG 4 00 

ACACCTACGT TGATCTGGCC GAAATGGTCA AAGACTACGA GCAGAATGGC 4 50 

ATGGAAGAAC ATGACCTTCA CACTTACTTC ACTTACCGTA AAAAGGAGCT 500 

TACAGAATGA 510 



2) INFORMATION FOR SEQ ID NO: 1495 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1495 
TCTCTAAACA TGATTGTCGC TGTC 



2) INFORMATION FOR SEQ ID NO: 1496 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1496 
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CAGTGAGGCA AAAGTTTTTC TACC 



2) INFORMATION FOR SEQ ID NO: 1497 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 97 
CGGACGACTT CATGTGGTAG TCAGT 



2) INFORMATION FOR SEQ ID NO: 1498 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1498 
TTTGTTTTCA GTAATGGTCG GGACCT 



2) INFORMATION FOR SEQ ID NO: 14 99 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 534 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 
(C) ACCESSION NUMBER: X57730 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:' 1499 
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ATGGCTTCTC TAAACATGAT 
ATTTGAAAAT CAGATTCCGT 
AAGCTGTTAC AATGAACTCA 
TCACTGCCTA AAGTGCTGCC 
5 AGTACCACCC ACCCAGAACA 
AGATCGCAGT AAGAACTGCA 
CAAATTTTTG TAATTGGTGG 
CGTGGACAAA CTCTACTTAA 
CTGAACTGGA TTTATCCCTA 
10 CCGACCATTA CTGAAAACAA 
CCCTAACCCT ATTAGTGAGG 



TGTCGCTGTC AATAAGACAG 
GGCATGAACC AGAAGATTTA 
GTTTTGATTA TGGGTAGAAA 
CGGACGACTT CATGTGGTAG 
CTGATCAAGT TGTGTATGTA 
AGCTTGTTGG TTGACAAACC 
GAAGAGTGCG TACGAGAACT 
CTAGAGTACA GCTCAACACA 
TTCAAGTCAT GGAAACTCGT 
AACAAAACTT ATTTTCCAAA 
AACCCACATG TTAG 



PCT/CA00/01150 

GAGGTATCGG 50 

AAACACTTCA 100 

AACTTTTGCC 150 

TCAGTAAAAC 200 

AGTACATACC 250 

AGAGTATTCT 300 

TAGCTGCCTA 350 

CAACAAGACA 400 

ATCTGAGGTC 4 50 

TTTGGATTAA 500 
534 



15 2) INFORMATION FOR SEQ ID NO: 1500 

(i) SEQUENCE CHARACTERISTICS: 

(A) ' LENGTH: 24 bases 

(B) TYPE: Nucleic acid 
20 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1500 

ATCGGGTTAT TGGCAATGGT CCTA 24 



30 



40 



45 



2) INFORMATION FOR SEQ ID NO: 1501 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 25 bases 
35 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1501 
GCGGTAGTTA GCTTGGCGTG AGATT 25 

2) INFORMATION FOR SEQ ID NO: 1502 



(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1502 
GCGGGCGGAG CTGAGATATA CA 



2) INFORMATION FOR SEQ ID NO: 1503 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: * Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1503 
AACGGAGTGG GTGTACGGAA TTACAG 



2) INFORMATION FOR SEQ ID NO: 1504 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 498 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 

(B) STRAIN: TKS84 

(C) ACCESSION NUMBER: Z21672 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1504 

ATGAACTCGG AATCAGTACG CATTTATCTC GTTGCTGCGA TGGGAGCCAA 50 

TCGGGTTATT GGCAATGGTC CTAATATCCC CTGGAAAATT CCGGGTGAGC 100 

AGAAGATTTT TCGCAGACTC ACTGAGGGAA AAGTCGTTGT CATGGGGCGA 150 

AAGACCTTTG AGTCTATCGG CAAGCCTCTA CCGAACCGTC ACACATTGGT 200 

AATCTCACGC CAAGCTAACT ACCGCGCCAC TGGCTGCGTA GTTGTTTCAA 250 

CGCTGTCGCA CGCTATCGCT TTGGCATCCG AACTCGGCAA TGAACTCTAC 300 

GTCGCGGGCG GAGCTGAGAT ATACACTCTG GCACTACCTC ACGCCCACGG 350 

CGTGTTTCTA TCTGAGGTAC ATCAAACCTT CGAGGGTGAC GCCTTCTTCC 400 

CAATGCTCAA CGAAACAGAA TTCGAGCTTG TCTCAACCGA AACCATTCAA 4 50 

GCTGTAATTC CGTACACCCA CTCCGTTTAT GCGCGTCGAA ACGGCTAA 4 98 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1505 
ATTTTTCGCA GGCTCACCGA GAGC 24 



2) INFORMATION FOR SEQ ID NO: 1506 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1506 
CGGATGAGAC AACCTCGAAT TCTGCTG 27 



2) INFORMATION FOR SEQ ID NO: 1507 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 98 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 

(B) STRAIN: RA33.2 

(C) ACCESSION NUMBER: Z50802 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1507 

ATGAACCCGG AATCGGTCCG CATTTATCTG GTCGCTGCCA TGGGTGCCAA 50 

TCGGGTTATT GGCAATGGTC CCGATATCCC CTGGAAAATC CCAGGTGAGC 100 

AGAAGATTTT TCGCAGGCTC ACCGAGAGCA AAGTGGTCGT TATGGGCCGC 150 

AAGACATTTG AGTCCATAGG CAAGCCCTTA CCAAACCGCC ACACAGTGGT 200 
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GCTCTCGCGC CAAGCTGGTT ATAGCGCTCC TGGTTGTGCA GTTGTTTCAA 
CGCTGTCACA CGTATCGCCA TCGACAGCCG AACACGGCAA AGAACTCTAC 
GTAGCGCGCG GAGCCGAGGT ATATGCGCTG GCGCTACCGC ATGCCAACGG 
CGTCTTTCTA TCTGAGGTAC ATCAAACCTT TGAGGGTGAC GCCTTCTTCC 
CAGTGCTTAA CGCAGCAGAA TTCGAGGTTG TCTCATCCGA AACCATTCAA 
GGCACAATCA CGTACACGCA CTCCGTCTAT GCGCGTCGTA ACGGCTAA 



250 
300 
350 
400 
450 
498 



2) INFORMATION FOR SEQ ID NO: 1508 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 bases 

(B) TYPE: • Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1508 
AGAATGTATT GGTATTTCCA TCTATCG 2 



2) INFORMATION FOR SEQ ID NO: 1509 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1509 
CAATGTCGAT TGTTGAAATA TGTAAA 2 



2) INFORMATION FOR SEQ ID NO: 1510 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1510 
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2) INFORMATION FOR SEQ ID NO: 1511 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1511 
CAGACACAAT CACATGATCC GTTATCG 



2) INFORMATION FOR SEQ ID NO: 1512 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 474 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 

(B) STRAIN: UI14 

(C) ACCESSION NUMBER: Z83331 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1512 

GTGAAACTAT CACTAATGGC AGCAATTTCG AAGAATGGAG TTATCGGAAA 50 

TGGCCCAGAT ATTCCATGGA GTGCCAAAGG GGAACAATTA CTCTTCAAAG 100 

CGATTACCTA TAATCAGTGG CTTTTGGTAG GCCGAAAGAC TTTCGAGTCA 150 

ATGGGGGCTT TACCCAACCG AAAATATGCC GTTGTAACTC GTTCAAGCTT 200 

CACTTCCAGT GATGAGAATG TATTGGTATT TCCATCTATC GATGAAGCGC 250 

TAAATCATCT GAAGACGATA ACGGATCATG TGATTGTGTC TGGTGGTGGT 300 

GAAATATACA AAAGCCTGAT CGATAAAGTT GATACTTTAC ATATTTCAAC 350 

AATCGACATT GAGCCAGAAG GTGATGTCTA TTTTCCAGAA ATCCCCAGTA 400 

GTTTTAGGCC AGTTTTTAGC CAAGACTTCG TGTCTAACAT AAATTATAGT 4 50 

TACCAAATCT GGCAAAAGGG TTAA 474 



2) INFORMATION FOR SEQ ID NO: 1513 

(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

5 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1513 
10 TTCAAGCTCA AATGAAAACG TCC 23 



15 



2) INFORMATION FOR' SEQ ID NO: 1514 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
20 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1514 

25 

GAAATTCTCA GGCATTATAG GGAAT 25 



30 2) INFORMATION FOR SEQ ID NO: 1515 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 
35 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1515 

GTGGTCAGTA AAAGGTGAGC AAC 23 



45 

2) INFORMATION FOR SEQ ID NO: 1516 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 26 bases 
50 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear ' 



785 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



PAGE. 



MISSING AT THE TIME OF PUBLICATION 



i 



i 
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CTATGTCTCA AGGCCGTGCA ACATACTCTA TGGAATTTGC TAAATATGCT 
GAAACTCCAC GTAACGTGGC TGAAGGCATC ATTTCTAAAT TTCAGTCTGG 
CGGTAAAAAA GGTGACGACG AGTAA 



50 
100 
125 



2) INFORMATION FOR SEQ ID NO: 1519 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 93 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double, 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: AciiietoJbacter Iwoffi 

(B) STRAIN: CDCF 3697 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1519 

TCTTTCGATT ACTATAAGCC CAAACTAATT CATAGTTAAA AACCAAGTGC 50 
TCATGCAGTG ATCCTGCATG AGTAGTTTAA AAAGGAAGAT CTC 93 



2) INFORMATION FOR SEQ ID NO: 1520 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1106 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Acinetobacter Iwoffi 

(B) STRAIN: CDCF 3697 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1520 

ATGGCTAAGG CTAAGTTTGA ACGTAATAAG CCACACGTTA ACGTGGGCAC 50 

AATCGGTCAC GTTGACCATG GTAAAACAAC TTTAACAGCT GCAATTGCAA 100 

CTGTATGTGC GAAGAAATTC GGTGGCGAAG CGAAAGACTA CGCTGCAATT 150 

GACTCTGCAC CAGAAGAAAA AGCACGTGGT ATTACAATTA ATACTTCACA 200 

CGTAGAATAC GATTCTCCAA CTCGTCACTA CGCACACGTA GACTGCCCGG 250 

GCCACGCCGA TTATGTTAAA AACATGATTA CTGGTGCTGC TCAGATGGAC 300 

GGCGCGATCC TTGTATGTGC TGCGACTGAT GGTCCAATGC CACAGACTCG 350 

TGAACACATC CTTCTTTCTC GTCAGGTTGG TGTACCTTAC ATTCTTGTAT 400 

TCCTTAACAA GTGTGACCTT GTTGATGATG AAGAACTTCT TGAGCTAGTG 4 50 
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GAAATGGAAG TTCGTGAACT 
CACTCCAGTT ATCCGTGGTT 
GTCAGTATGG CGAAGAAGCA 
TACATTCCAG AGCCAGTACG 
CGAAGACGTA TTCTCTATTT 
TAGAAACTGG TATTGTGAAA 
CGTGATACTC AAGTAACTAC 
GCTTGACGAA GGTCGTGCGG 
CTAAGCGTGA AGACGTACAA 
ATCAAGCCAC ACACTAAATT 
AGAAGGTGGT CGTCACACTC 
ACTTCCGTAC AACTGACGTA 
GAAATGGTTA TGCCTGGTGA 
CCCAAT 



TCTTTCTACT TATGACTTCC 
CAGCTCTTCT TGCACTTAAC 
GTTGTTGCGC TTGTTGACGC 
TGCAATCGAC CAAGCATTCT 
CTGGTCGTGG TACAGTAGTA 
GTAGGCGAAT CAGTTGAAAT 
AGTTACTGGC GTAGAAATGT 
GCGAGAACTG TGGTGTTCTT 
CGTGGTCAAG TACTTGCTAA 
CGATGCAGAA GTATACGTAC 
CATTCCTTAA CGGTTACCGT 
ACTGGCGCGA TCAAATTACA 
CAACGTAGAA ATGTCAGTAG 



CAGGTGATGA 500 

GGTGACGCTG 550 

ACTTGACACT 600 

TAATGCCAAT 650 

ACTGGCCGTG 700 

CGTTGGTATC 750 

TCCGTAAATT 800 

CTACGTGGTA 8 50 

ACCAGGTGCA 900 

TTTCTAAAGA 950 

CCACAGTTCT 1000 

AGATGGCGTT 1050 

AATTAATCCA 1100 
1106 



2) INFORMATION FOR SEQ ID NO: 1521 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 100 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Haemophilus influenzae 

(B) STRAIN: ATCC 9006 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1521 

ACAAACTCAA GGTCGTGCAT CTTACTCAAT GGAACCGTTA AAATATGCTG 50 
AAGCTCCAAC AAGTGTTGCG GCTGCAGTAA TTGAAGCGCG TAAAAAATAA 100 



2) INFORMATION FOR SEQ ID NO: 1522 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 64 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Haemophilus influenzae 

(B) STRAIN: ATCC 9006 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1522 
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TTTTTGTAAA CCAGCGGTGT AAAATATGAT TGTTTTATAC CGCACTTCTT 
AGGAAACATT AGAA 



50 
64 



2) INFORMATION FOR SEQ ID NO: 1523 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1098 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Haemophilus influenzae 

(B) STRAIN: ATCC 9006 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1523 

ATGTCTAAAG AAAAATTTGA ACGTACAAAA CCGCACGTAA ACGTGGGTAC 50 

AATCGGCCAC GTTGACCACG GTAAAACAAC TTTAACAGCA GCAATTACAA 100 

CCGTATTAGC AAAACACTAC GGTGGTGCAG CGCGTGCATT TGACCAAATC 150 

GATAACGCGC CAGAAGAAAA AGCGCGTGGT ATTACCATCA ACACTTCACA 200 

TGTTGAATAC GATACACCAA CTCGCCACTA TGCACACGTA GACTGTCCAG 250 

GACACGCCGA CTATGTTAAA AACATGATTA CCGGTGCGGC GCAAATGGAT 300 

GGTGCTATTT TAGTAGTAGC AGCAACAGAT GGTCCTATGC CACAAACTCG 350 

TGAACATATC TTATTAGGTC GCCAAGTAGG TGTTCCATAC ATCATCGTAT 400 

TCTTAAACAA ATGCGACATG GTAGATGATG AAGAGTTATT AGAATTAGTA 450 

GAAATGGAAG TGCGTGAACT TCTATCTCAA TATGACTTCC CAGGTGACGA 500 

TACACCAATC GTACGTGGTT CAGCATTACA AGCATTGAAC GGCGTAGCAG 550 

AATGGGAAGA AAAAATCCTT GAATTAGCTG GTCACTTAGA TACTTACATC 600 

CCAGAACCAG AACGTGCGAT TGACCAACCG TTCCTTCTTC CAATTGAAGA 650 

CGTATTCTCA ATTTCAGGTC GTGGTACAGT AGTAACTGGT CGTGTAGAAC 7 00 

GTGGTATCAT CCGTACTGGT GATGAAGTAG AAATCGTCGG TATCAAAGAT 750 

ACAGCGAAAA CTACTGTAAC AGGTGTTGAA ATGTTCCGTA AATTACTTGA 800 

CGAAGGTCGT GCAGGTGAAA ACATCGGTGC ATTATTACGT GGTACCAAAC 850 

GTGAAGAAAT CGAACGTGGT CAAGTATTAG CGAAACCAGG TTCAATCACA 900 

CCACACACTG ATTTTGAATC AGAAGTATAC GTATTATCAA AAGATGAAGG 950 

TGGTCGTCAT ACTCCATTCT TCAAAGGTTA CCGTCCACAA TTCTATTTCC 1000 

GTACAACAGA CGTAACTGGT ACAATTGAAT TACCAGAAGG CGTGGAAATG 1050 

GTAATGCCAG GCGATAACAT CAAGATGACA GTAAGCTTAA TCCACCCA 1098 



2) INFORMATION FOR SEQ ID NO: 1524 

(i) SEQUENCE CHARACTERISTICS : , 

(A) LENGTH: 77 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
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10 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Proteus mirabilis 

(B) STRAIN: ATCC 25933 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1524 

CAATGGAGTT CTTGAAGTAC AACGAAGCGC CTAGCAACGT CGCTCAGGCT 50 
ATTATCGAAG CTCGTAAAGC GAAATAA 77 



15 



25 



35 



50 



2) INFORMATION FOR SEQ ID NO: 1525 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 67 bases 
20 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Proteus mirabilis 

(B) STRAIN: ATCC 25933 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1525 

GATCCTTTCG AGTTCAATTT AGTTTACGCT CCCTCTGTGA GAGGGAGCGA 50 
TATTAAGGAA TATAGTC 67 



2) INFORMATION FOR SEQ ID NO: 1526 



(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 1112 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

45 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Proteus mirabilis 

(B) STRAIN: ATCC 25933 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1526 

GTGTCTAAAG AAAAATTTGA ACGTTCAAAA CCGCACGTTA ACGTTGGTAC 50 
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TATCGGCCAC GTTGACCACG 
CAGTTTTAGC TAAAACTTAC 
GATAATGCAC CAGAAGAAAA 
CGTAGAATAC GATACTCCAA 
5 GTCACGCCGA CTATGTTAAA 
GGCGCTATTC TGGTAGTAGC 
TGAGCACATC CTGTTAGGTC 
TCCTGAACAA ATGTGACATG 
GAAATGGAAG TTCGTGAACT 

10 CACTCCAGTA ATCCGTGGTT 
AGTGGGAAGC AAAAATTGTT 
CCAGAGCCAG AGCGTGCAAT 
TGTATTCTCA ATCTCAGGCC 
GTGGTATCAT CAAAGTAGGT 

15 ACCACCAAAA CAACTTGTAC 
CGAAGGTCGT GCAGGTGAGA 
GTGAAGAAAT CGAACGTGGA 
CCACACAACA AATTTGAATC 
TGGTCGTCAC ACACCATTCT 

20 GTACAACTGA CGTAACTGGT 
GTAATGCCAG GCGACAACGT 
CGCAATGGAC GA 



GTAAAACAAC TCTGACTGCT 
GGTGGTGCTG CTCGTGCATT 
AGCGCGTGGT ATCACCATCT 
CTCGCCACTA CGCACACGTA 
AACATGATCA CTGGTGCTGC 
AGCAACTGAT GGTCCAATGC 
GTCAGGTTGG TGTTCCTTAC 
GTAGATGATG AAGAGCTGTT 
TCTGTCTCAA TACGATTTCC 
CAGCGCTGAA AGCACTGGAA 
GAATTAGCAG AAGCACTGGA 
TGACAAACCA TTCCTGTTAC 
GTGGTACAGT AGTTACTGGT 
GATGAAGTTG' AGATTGTTGG 
TGGCGTTGAA ATGTTCCGTA 
ACGTAGGTGT TCTGCTGCGT 
CAAGTACTGG CAAAACCAGG 
AGAAGTTTAT ATTCTGAGCA 
TCAAAGGCTA CCGTCCACAG 
ACTATCGAAT TACCAGAAGG 
GAACATGATC GTTGAACTGA 



PCT/CAOO/01150 

GCAATCACTA 100 

CGACCAAATC 150 

CTACTTCACA 200 

GACTGCCCAG 250 

GCAAATGGAC 300 

CACAAACTCG 350 

ATCATCGTAT 400 

AGAATTAGTT 450 

CAGGTGATGA 500 

GGCGAAGCAG 550 

TTCTTATATC 600 

CAATCGAAGA 650 

CGTGTAGAGC 700 

TATCAAAGAA 750 

AATTACTTGA 800 

GGTACAAAAC 850 

CTCAATCAAC 900 

AAGATGAAGG 950 

TTCTACTTCC 1000 

CGTAGAAATG 1050 

TCCACCCAAT 1100 
1112 



25 



35 



2) INFORMATION FOR SEQ ID NO: 1527 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 800 bases 
30 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Campylobacter curvus 

(B) STRAIN: ATCC 35224 

40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1527 

ATCAACGAAG CTATCGAGGT TTATTTTGAG GTTGAGGGCA AGAAAAATAG 50 

ATTGATCCTG GAGGTCGCGG CTCACTTGGG TGATAACCGC GTCAGAACGA 100 

TCGCTATGGA TATGAGTGAG GGGCTTACTC GCGGGCTTGA AGCTACCGCT 150 

45 CTTGGTGCGC CTATTAGTGT GCCGGTTGGC GAGAAGGTTT TGGGAAGAAT 200 

TTTTAACGTC GTCGGCGATC TCATCGACGA GGGCGAGGGC GTAAATTTTG 250 

ATAAACATTG GTCTATCCAC CGCGATCCGC CACCATTTGA AGAACAAAGC 300 

ACGAAAAGTG AAATTTTTGA AACCGGTATA AAGGTTGTGG ATCTTCTTGC 350 

GCCTTACGCA AAGGGCGGTA AGGTCGGACT ATTTGGCGGT GCAGGTGTCG 400 

50 GTAAAACGGT CATCATCATG GAGCTCATCC ACAATGTCGC CTTCAAACAC 450 

AGCGGATACT CTGTATTTGC AGGTGTTGGC GAGAGGACGC GCGAAGGAAA 500 

CGACCTTTAT CACGAGATGA AAGAAAGTAA CGTTTTGGAT AAAGTCGCCT 550 

TGTGCTACGG ACAGATGAAC GAGCCGCCAG GGGCGAGAAA TCGTATCGCA 600 
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CTGACTGGTC TAACGATGGC TGAGTATTTC 
TGTGCTTATG TTTATCGACA ACATCTTCCG 
AGATGTCGGC ACTCCTCGGA CGTATCCCAT 
ACGCTGGCAA GCGAGATGGG TAAATTTCAA 
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CGCGATGAGA TGGGACTTGA 650 

CTTCTCTCAA TCTGGTGCAG 700 

CAGCCGTTGG TTACCAGCCG 750 

GAAAGGATCA CATCGACTAA 800 



2) INFORMATION FOR SEQ ID NO: 1528 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

15 

(ii) MOLECULE TYPE:, DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1528 
20 AACTTGAGCG ATTTTCGGAT ACCCTG 26 



2) INFORMATION FOR SEQ ID NO: 1529 



25 



(i) SEQUENCE CHARACTERISTICS:. 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
30 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1529 

35 

TTGCCGATGA AATAACCGCC GACT 24 



40 2) INFORMATION FOR SEQ ID NO: 1530 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1035 bases 

(B) TYPE: Nucleic acid 
45 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

50 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 

(C) ACCESSION NUMBER: M11277 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1530 



ATGCGATTGG TTTGGAAATG TGGGGCGATT CAGGCATCCC GGTTATCTGA 50 

ATGGCTCAAC TCAACAGCCG GTGCTCATGA ACTTGAGCGA TTTTCGGATA 100 

5 CCCTGACCTT TTCTGTGTAT GGCTCAGTGC TGATCTGGCT GAAATCATAT 150 

CTCCGCGAAT CAGGAAGAAA ACTGCAGTTA GTCGGAATCG CCTTACCCAA 200 

CACCCTGAAC CCAAGGGACG ACCTAGCGCA ATTGGCCGAA ATTATCCAGC 250 

TCATCGATCA CCTCATGAAA CCGCACGTTG ATATGTTGAC TCACTTGTTG 300 

GCGTCCATTG ATGGCCAGTC GGCGGTTATT TCATCGGCAA AATGGGGGGA 350 

10 GCTAGAAACG GCTCGGCAGG AGAAAGCTAT CTCAGGGGTA ACCAGATTGA 400 

AGCTCCGCTT GGCGTCGCTT GCCCCCGTCC TGAAAAAACA CGTCAACAGC 4 50 

GATTTGTTCC GAAAAGCCTC TGATCGAATA GAGTCGATAG AGTATACGTT 500 

GGAAACCTTG CGTATAATGA AAACTTTCTT CGATGGTACC TCTCTTGAGG 550 

GAGATACTTC CGTACGTGAC TCGTATATGG CGGGCGTAGT AGATGGAATG 600 

15 GTTCGAGCGA ATCCGGATGT GAAGATAATT CTGCTGGCGC ACAACAATCA 650 

TCTACAAAAA ACTCCAGTCT CCTTTTCAGG CGAGCTTACG GCTGTTCCCA 700 

TGGGGCAGCA CCTCGCAGAG AGGGTGAATT ACCGTGCGAT TGCATTCACC 750 

CATCTTGGAC CCACCGTGCC GGAAATGCAT TTCCCATCGC CAAAAAGTCC 800 

TCTTGGATTC TCTGTTGTGA CCACGCCTGC CGATGCAATC CGTGAGGATA 850 

20 GTATGGAACA GTATGTCATC GACGCCTGTG GTACGGAGAA TTCATGTCTG 900 

ACATTGACAG ATGCCCCCAT GGAAGCAAAG CGAATGCGGT CTCAAAGCGC 950 

CTCTGTAGAA ACGAAATTGA GCGAGGCATT TGATGCCATC GTCTGTGTTA 1000 

CAAGCGCCGG CAAGGACAGC CTGGTTGCCC TAT AG 1035 



25 



2) INFORMATION FOR SEQ ID NO: 1531 



(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

35 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1531 
TCTTTTTGTT ACGACATACG CTTTT 25 

40 



2) INFORMATION FOR SEQ ID NO: 1532 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



50 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1532 

793 



SUBSTITUTE SHEET (RULE 26) 



2oiTov wwfeo ■ 1 1 .00) 

AGTGCTTCTT TATCCGCTGT TCTA 24 



2) INFORMATION FOR SEQ ID NO: 1533 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 26 bases 
10 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



15 



20 



45 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1533 
CAGCGGATAA AGAAGCACTA CACATT 26 

2) INFORMATION FOR SEQ ID NO: 1534 



(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

3 0 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1534 
CCTCCTGAAA TAAAGCCCGA CAT 23 

35 



2) INFORMATION FOR SEQ ID NO: 1535 

4 0 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1260 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 
50 (C) ACCESSION NUMBER: A15097 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1535 
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ATGAGGTTCG 


AAGAATGGGT 


CAAAGATAAG 


CATATTCCTT 


NCAAACNGAA 


50 




TCACCCTGAT 


GATAATTACG 


ATGATTTTAA 


GCCATTAAGA 


AANATAATTG 


100 




GAGATACCCG 


AGTTGTAGCA 


TTAGGTGAAA 


ATTCTCATTT 


CATAAAAGAA 


150 




TTCTTTTTGT 


TACGACATAC 


GCTTTTGCGT 


TTTTTTATCG 


AAGATCTAGG 


200 


5' 


TTTTACTACG 


TTTGCTTTTG 


AATTTGGTTT 


TGCTGAGGGT 


CAAATCATCA 


250 




ATAACTGGAT 


ACATGGACAA 


GGAACTGACG 


ATGAAATAGG 


CAGATTCTTA 


300 




AAACACTTCT 


ATTATCCAGA 


AGAGCTCAAA 


ACCACATTTC 


TATGGCTAAG 


350 




GGAGTACAAT 


AAAGCAGCAA 


AAGAAAAAAT 


CACATTTCTT 


GGCATTGATA 


400 




TACCCAGAAA 


TGGAGGTTCA 


TACTTACCAA 


ATATGGAGAT 


AGTGCATGAC 


450 


10 


TTTTTTAGAA 


CAGCGGATAA 


AGAAGCACTA 


CACATTATCG 


ATGATGCATT 


500 




TAATATTGCA 


AAAAAGATTG 


ATTACTTCTC 


CACATCACAG 


GCAGCCTTAA 


550 




ATTTACATGA 


GCTAACAGAT 


TCTGAGAAAT 


GCCGTTTAAC 


TAGCCAATTA . 


600 




GCTCGAGTAA 


AAGTTCGCCT 


TGAAGCTATG 


GCTCCAATTC 


ACATTGAAAA 


650 




ATATGGGATT 


GATAAATATG 


AGACAATTCT 


GCATTATGCC 


AACGGTATGA 


700 


15 


TATACTTGGA 


CTATAACATT 


CAAGCTATGT 


CGGGCTTTAT 


TTCAGGAGGC 


750 




GGAATGCAGG 


GCGATATGGG 


TGCAAAAGAC 


AAATACATGG 


CAGATTCTGT 


800 




GCTGTGGCAT 


TTAAAAAACC 


CACAAAGTGA 


GCAGAAAGTG 


ATAGTAGTAG 


850 




CACATAATGC 


ACATATTCAA 


AAAACACCCA 


TTCTGTATGA 


TGGATTTCTA 


900 




AGTTGCCTAC 


CAATGGGCCA 


AAGACTTAAA 


AATGCCATTG 


GTGATGATTA 


950 


20 


TATGTCTTTA 


GGTATTACTT 


CTTATAGTGG 


GCATACTGCA 


GCCCTCTATC 


1000 




CGGAAGTTGA 


TACAAAATAT 


GGTTTTCGAG 


TTGATAACTT 


CCAACTGCAG 


1050 




GAACCAAATG 


AAGGTTCTGT 


CGAGAAAGCT 


ATTTCTGGTT 


GTGGAGTTAC 


1100 




TAATTCTTTT 


GTCTTTTTTA 


GAAATATTCC 


TGAAGATTTA 


CAATCCATCC 


1150 




CGAACATGAT 


TCGATTTGAN 


TCTATTTACA 


TGAAAGCAGA 


ACTCGAGAAA 


1200 


25 


GCTTTCGATG 
TTATGAATAA 


GAATATTTCA 


AATTGAAAAG 


TCATCTGTAT 


CTGAGGTCGT 


1250 
1260 



30 2) INFORMATION FOR SEQ ID NO: 1536 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 
35 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1536 

AGATGTATTA ACTGGAAAAC AACAA 25 



45 

2) INFORMATION FOR SEQ ID NO: 1537 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 25 bases 
50 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1537 
CTTTGTAATT AGTTTCTGAA AACCA 



2) INFORMATION FOR SEQ ID NO: 1538 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single' 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1538 
TTAGAAGATA TAGGATACAA AATAGAAG 



2) INFORMATION FOR SEQ ID NO: 1539 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1539 
GAATGAAAAA GAAGTTGAGC TT 



2) INFORMATION FOR SEQ ID NO: 1540 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 486 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus haemolyticus 
(C) ACCESSION NUMBER: M14039 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1540 

ATGAAAAATA ATAATGTAAC AGAAAAAGAA TTATTTTATA TTTTAGATTT 50 

5 ATTTGAACAC ATGAAAGTAA CTTATTGGTT AGATGGTGGC TGGGGGGTAG 100 

ATGTATTAAC TGGAAAACAA CAAAGAGAAC ACAGAGATAT AGATATAGAT 150 

TTTGACGCTC AACACACTCA AAAAGTTATA CAAAAATTAG AAGATATAGG 200 

ATACAAAATA GAAGTTCATT GGATGCCTTC ACGTATGGAA CTTAAGCATG 250 

AAGAATATGG GTATTTAGAT ATTCATCCTA TAAATCTAAA TGATGATGGA 300 

10 TCAATTACCC AAGCAAACCC AGAAGGTGGT AATTATGTTT TCCAAAATGA 350 

CTGGTTTTCA GAAACTAATT ACAAAGATCG AAAAATACCA TGTATTTCAA 400 

AAGAAGCTCA ACTTCTTTTT CATTCTGGTT ATGATTTAAC AGAAACAGAC 450 

CATTTTGATA TAAAAAATTT AAAATCAATA ACATAA 486 



15 



50 



2 ) INFORMATION FOR SEQ ID NO: 1541 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1541 
■ TGATAATCTT ATACGTGGGG AATTT 25 

30 

2) INFORMATION FOR SEQ ID NO: 1542 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid. 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

40 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1542 
45 ATAATTTTCT AATTGCCCTG TTTCAT 26 



2) INFORMATION FOR SEQ ID NO: 154 3 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

797 
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(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1543 
GGGCAATTAG AAAATTATTT ATCAGA 



2) INFORMATION FOR SEQ ID NO: 1544 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1544 
TTTTACTCAT GTTTAGCCAA TTATCA 



2) INFORMATION FOR SEQ ID NO: 1545 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 804 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecium 
(C) ACCESSION NUMBER: AF110130 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1545 

ATGTTAAAAC AAAAAGAATT AATTGCAAAC GTTAAGAATC TTACTGAGTC 50 

AGATGAACGA ATTACAGCTT GTATGATGTA TGGATCGTTT ACCAAAGGAG 100 

AAGGTGACCA ATACTCTGAT ATAGAGTTCT ATATATTTTT GAAACATAGT 150 

ATAACCTCGA ACTTTGATTC ATCCAACTGG TTGTTTGACG TAGCTCCGTA 200 

CTTGATGCTT TATAAAAATG AGTACGGAAC AGAGGTAGTT ATTTTTGATA 250 

ATCTTATACG TGGGGAATTT CATTTCCTTT CTGAAAAAGA TATGAACATA 300 

ATCCCCTCGT TTAAAGATTC AGGTTATATT CCTGATACGA AGGCTATGCT 350 

TATTTACGAT GAAACAGGGC AATTAGAAAA TTATTTATCA GAGATAAGTG 400 

GTGCAAGACC AAATAGACTT ACTGAAGAAA ATGCTAATTT TTTGTTGTGT 4 50 

AATTTCTCTA ATCTATGGTT GATGGGAATC AACGTTCTAA AAAGAGGAGA 500 

798 
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ATATGCTCGT TCATTAGAAC TCTTATCACA 
AACTTATACG TATGGCAGAA AAAAATGCTG 
AAAAACCTTG AAAAAGAAAT TAGCCTTGAA 
GACCACTGCT CGATTAGATA AGGTAGAATT 
CTTTGCTATT AGTTATGGAT TTGCAAAGTC 
TTAAAAGTTA CACATGACAT TTTAGAAAGA 
ATAG 
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ACTTCAAAAA AATACACTAC 550 

ATAATTGGCT AAACATGAGT 600 

AATTATAAAA AATTTGCAAA 650 

ATTTGAAGCC TATAAAAATT 700 

ACCTTATTGA ACAATACAAC 750 

TTGTTGAATT ACATTAGTGA 800 

804 



2) INFORMATION FOR SEQ ID NO: 154 6 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1546 
CAAGAAGGAA TGGCTGTACT AC 22 



2) INFORMATION FOR SEQ ID NO: 1547 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1547 
TAATTCCCAA ATAACCCTAA TAATAGA 27 



2) INFORMATION FOR SEQ ID NO: 1548 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1218 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
(vi) ORIGINAL SOURCE: 
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(A) ORGANISM: Streptococcus pyogenes 
(C) ACCESSION NUMBER: U70055 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1548 

5 

ATGGAAAAAT ACAACAATTG GAAACTTAAG TTTTATACAA TATGGGCAGG 50 

GCAAGCAGTA TCATTAATCA CTAGTGCCAT CTTGCAAATG GCGATTATTT 100 

TTTACCTTAC AGAAAAAACT GGATCCGCGA TGGTCTTGTC TATGGCTTCA 150 

CTATTAGGTT TTTTACCCTA TGCGGTCTTT GGACCTGCAA TTGGTGTGCT 200 

10 AGTGGATCGT CATGATAGGA AGAAGATAAT GATTGGTGCT GATTTAATTA 250 

TCGCAGCAGC TGGTTCGGTG CTTACTATTG TTGCATTCTA TATGGAGCTA 300 

CCTGTCTGGA TGGTTATGAT AGTATTGTTT ATCCGTAGCA TTGGAACAGC 350 

TTTTCACACC CCGGCTCTCA ATGCGGTTAC, GCCACTTTTA GTACCAGAAG 4 00 

AACAGCTTAC GAAATGTGCA GGCTATAGTC AGTCTTTGCA GTCTATAAGC 4 50 

15 TATATTGTTA GTCCGGCGGT TGCAGCACTC TTATACTCCG TTTGGGAACT 500 

AAATGCTATT ATTGCCATCG ATGTATTGGG TGCTGTGATT GCATCTATTA 550 

CGGTAGCAAT TGTACGTATT CCTAAGCTGG GTGATCGCGT GCAAAGTTTG 600 

GACCCAAATT TCATAAGAGA AATGCAAGAA GGAATGGCTG TACTACGGCA 650 

AAATAAAGGA TTATTTGCTT TATTACTCGT TGGAACATTA TATATGTTTG 700 

2 0 TTTATATGCC AATTAATGCA CTATTCCCTT TAATTAGCAT GGATTACTTT 750 

AATGGAACAC CTGTGCATAT TTCTATTACG GAAATTTCCT TTGCATCTGG 800 

AATGTTGATA GGGGGTCTAT TATTAGGGTT ATTTGGGAAT TACCAAAAGC 850 

GAATCTTATT AATAACGGCA TCCATTTTTA TGATGGGGAT AAGCTTAACC 900 

ATTTCAGGAT TACTTCCCCA AAGTGGATTT TTCATTTTTG TAGTCTGCTG 950 

25 TGCAATAATG GGGCTTTCTG TTCCGTTTTA CAGCGGTGTG CAAACAGCTC 1000 

TTTTTCAGGA GAAAATTAAG CCTGAATATT TAGGACGTGT ATTTTCTTTA 1050 

ACTGGAAGTA TCATGTCTCT TGCTATGCCA ATTGGATTAA TTCTTTCTGC 1100 

ACTCTTTGCT GATAGAATCG GTGTAAATCA TTGGTTTTTA CTATCAGGTA 1150 

CTTTAATTAT TTGCATTGCA ATAGTTTGCC CAATGATAAA TGAGATTAGA 1200 

30 AAATTAGATT TAAAATAA 1218 



35 



2) INFORMATION FOR SEQ ID NO: 1549 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
40 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1549 

45 

GCTTATTATT AGGAAGATTA GGGGGC 26 



50 2) INFORMATION FOR SEQ ID NO: 1550 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 24 bases 
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(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1550 
TAGCAAGTGA CATGATACTT CCGA 



2) INFORMATION FOR SEQ ID NO: 1551 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1218 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 
(C) ACCESSION NUMBER: U83667 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1551 

ATGGAAAAAT ACAACAATTG GAAACGAAAA TTTTATGCAA TATGGGCAGG 50 

GCAAGCAGTA TCATTAATCA CTAGTGCCAT CCTGCAAATG GCGATTATTT 100 

TTTACCTTAC AGAAAAAACA GGATCTGCGA TGGTCTTGTC TATGGCTTCA 150 

TTAGTAGGTT TTTTACCCTA TGCGATTTTG GGACCTGCCA TTGGTGTGCT 200 

AGTGGATCGT CATGATAGGA AGAAGATAAT GATTGGTGCC GATTTAATTA 250 

TCGCAGCAGC TGGTGCAGTG CTTGCTATTG TTGCATTCTG TATGGAGCTA 300 

CCTGTCTGGA TGATTATGAT AGTATTGTTT ATCCGTAGCA TTGGAACAGC 350 

TTTTCATACC CCAGCACTCA ATGCGGTTAC ACCACTTTTA GTACCAGAAG 400 

AACAGCTAAC GAAATGCGCA GGCTATAGTC AGTCTTTGCA GTCTATAAGC 450 

TATATTGTTA GTCCGGCAGT TGCAGCACTC TTATACTCCG TTTGGGATTT 500 

AAATGCTATT ATTGCCATCG ACGTATTGGG TGCTGTGATT GCATCTATTA 550 

CGGTAGCAAT TGTACGTATA CCTAAGCTGG GTAATCAAGT GCAAAGTTTA 600 

GAACCAAATT TCATAAGGGA GATGAAAGAA GGAGTTGTGG TTCTGAGACA 650 

AAACAAAGGA TTGTTTGCCT TATTACTCTT AGGAACACTA TATACTTTTG 700 

TTTATATGCC AATCAATGCA CTATTTCCTT TAATAAGCAT GGAACACTTT 750 

AATGGAACGC CTGTGCATAT TTCTATTACG GAAATTTCCT TTGCATTTGG 800 

GATGCTAGCA GGAGGCTTAT TATTAGGAAG ATTAGGGGGC TTCGAAAAGC 850 

ATGTATTACT AATAACAAGT TCATTTTTTA TAATGGGGAC CAGTTTAGCC 900 

GTTTCGGGAA TACTTCCTCC AAATGGATTT GTAATATTCG TAGTTTGCTG 950 

TGCAATAATG GGGCTTTCGG TGCCATTTTA TAGCGGTGTG CAAACAGCTC 1000 

TTTTTCAGGA GAAAATTAAG CCTGAATATT TAGGACGTGT ATTTTCTTTG 1050 

ATCGGAAGTA TCATGTCACT TGCTATGCCA ATTGGGTTAA TTCTTTCTGG 1100 

ATTCTTTGCT GATAAAATCG GTGTAAATCA TTGGTTTTTA CTATCAGGTA 1150 

TTTTAATTAT TGGCATTGCT ATAGTTTGCC AAATGATAAC TGAGGTTAGA 1200 

AAATTAGATT TAAAATAA 1218 
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2) INFORMATION FOR SEQ ID NO: 1552 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1552 
GGCAAGCAGT ATCATTAATC ACTA 



2) INFORMATION FOR SEQ ID NO: 1553 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE -TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1553 
CAATGCTACG GATAAACAAT ACTATC 



2) INFORMATION FOR SEQ ID NO: 1554 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1554 
AGAAAATTAA GCCTGAATAT TTAGGAC 



2) INFORMATION FOR SEQ ID NO: 1555 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 
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(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1555 
TAGTAAAAAC CAATGATTTA CACCG 25 



2) INFORMATION FOR SEQ ID NO: 1556 



(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

20. (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1556 

ACTGTACGCA CTTGCAGCCC GACAT 25 

25 

2) INFORMATION FOR SEQ ID NO: 1557 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

35 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1557 
4 0 GAACGGCAGG CGATTCTTGA GCAT 24 



2) INFORMATION FOR SEQ ID NO: 1558 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
50 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
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GTGGTGGTGC ATGGCGATCT CT 
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2) INFORMATION FOR SEQ ID NO: 1559 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1559 
GCCGCAGCGA GGTACTCTTC GTTA 



2) INFORMATION FOR SEQ ID NO: 1560 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 906 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 
(C) ACCESSION NUMBER: D16251 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1560 

ATGACCGTAG TCACGACCGC CGATACCTCC CAACTGTACG CACTTGCAGC 50 

CCGACATGGG CTCAAGCTCC ATGGCCCGCT GACTGTCAAT GAGCTTGGGC 100 

TCGACTATAG GATCGTGATC GCCACCGTCG ACGATGGACG TCGGTGGGTG 150 

CTGCGCATCC CGCGCCGAGC CGAGGTAAGC GCGAAGGTCG AACCAGAGGC 200 

GCGGGTGCTG GCAATGCTCA AGAATCGCCT GCCGTTCGCG GTGCCGGACT 250 

GGCGCGTGGC CAACGCCGAG CTCGTTGCCT ATCCCATGCT CGAAGACTCG 300 

ACTGCGATGG TCATCCAGCC TGGTTCGTCC ACGCCCGACT GGGTCGTGCC 350 

GCAGGACTCG GAGGTCTTCG CGGAGAGCTT CGCGACCGCG CTCGCCGCCC 4 00 

TGCATGCCGT CCCCATTTCC GCCGCCGTGG ATGCGGGGAT GCTCATCCGT 450 

ACACCGACGC AGGCCCGTCA GAAGGTGGCC GACGACGTTG ACCGCGTCCG 500 

ACGCGAGTTC GTGGTGAACG ACAAGCGCCT CCACCGGTGG CAGCGCTGGC 550 

TCGACGACGA TTCGTCGTGG CCAGATTTCT CCGTGGTGGT GCATGGCGAT 600 

CTCTACGTGG GCCATGTGCT CATCGACAAC ACGGAGCGCG TCAGCGGGAT 650 

GATCGACTGG AGCGAGGCCC GCGTTGATGA CCCTGCCATC GACATGGCCG 700 

CGCACCTTAT GGTCTTTGGT GAAGAGGGGC TCGCGAAGCT CCTCCTCACG 750 
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TATGAAGCGG CCGGTGGCCG GGTGTGGCCG 
GGAGCGCCTT GCGTTCGGGG CGGTCACCTA 
CGGGTAACGA AGAGTACCTC GCTGCGGCGA 
GAATGA 



CGGCTCGCCC ACCACATCGC 800 
CGCACTCTTC GCCCTCGACT 850 
AGGCGCAGCT CGCCGCAGCG 900 

906 



2) INFORMATION FOR SEQ ID NO: 1561 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1048 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



15 



(ii) MOLECULE TYPE :• Genomic DNA 



(vi) ORIGINAL SOURCE; 

(A) ORGANISM: Candida albicans 
20 (B) STRAIN: ATCC 18804 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1561 

GATCATGGTA AAACTACATT GACTGCTGCT ATCACCAAAG TTTTAGCCGA 50 

25 ACAAGGTGGT GCCAACTTCT TGGATTAYGG TTCTATTGAT AGAGCTCCAG 100 

AAGAAAGAGC TAGAGGTATC ACTATTTCCA CTGCCCACGT TGAATACGAA 150 

ACCAAGAACA GACACTATGC CCACGTTGAT TGTCCAGGAC ACGCTGATTA 200 

TATCAAAAAT ATGATTACTG GTGCCGCTCA AATGGATGGT GCTATCATTG 250 

TTGTTGCTGC CACTGATGGT CAAATGCCTC AAACCAGAGA ACATTTGTTA 300 

30 TTGGCCAGAC AAGTTGGTGT TCAAGACTTG GTTGTGTTTG TCAACAAAGT 350 

CGATACTATT GATGACCCTG AAATGTTGGA ATTAGTCGAA ATGGAAATGA 400 

GAGAATTGTT ATCCACCTAC GGTTTTGATG GTGACAACAC TCCAGTTATT 450 

ATGGGATCTG CTTTAATGGC TTTGGAAGAC AAGAAACCAG AAATTGGTAA 500 

GGAAGCTATC TTGAAATTGT TAGATGCTGT CGATGAACAC ATTCCAACTC 550 

35 CATCAAGAGA CTTGGAACAA CCATTTTTGT TACCAGTTGA AGACGTGTTC 600 

TCCATCTCCG GTAGAGGAAC TGTTGTCACT GGTAGAGTTG AAAGAGGTGT 650 

TTTGAAGAAG GGTGAAGAAA TCGAAATTGT TGGTGGTTTT GACAAACCTT 700 

ACAAGACTAC TGTTACCGGT ATTGAAATGT TCAAAAAAGA ATTAGACTCT 750 

GCTATGGCTG GTGACAACTG TGGTGTTTTG TTAAGAGGTG TTAAAAGAGA 800 

4 0 TGAAATCAAG AGAGGTATGG TTTTGGCCAA ACCAGGTACT GCCACTTCTC 850 

ACAAGAAGTT CTTGGCTTCC TTGTATATTT TGACTTCCGA AGAAGGTGGY 900 

CGTTCCACTC CATTTGGTGA AGGTTACAAG CCTCAATGCT TCTTCAGAAC 950 

TAACGATGTC ACTACCACAT TTTCATTCCC AGAAGGAGAA GGTGTTGACC 1000 

ATTCTCAAAT GATCATGCCA GGTGACAACA TTGAAATGGT TGGTGAAT 1048 

45 



2) INFORMATION FOR SEQ ID NO: 1562 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1074 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
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(D) TOPOLOGY: Linear 
(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida dubliniensis 

(B) STRAIN: NCPF 3949 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1562 

TGATCACGGT AAAACCACAT TAACTGCTGC CATTACCAAA GTATTAGCTG 50 

AACAAGGTGG TGCCAACTTT TTGGATTACG GTTCCATTGA TAGAGCTCCA 100 

GAAGAAAGAG CCAGAGGTAT CACTATTTCC ACTGCCCACG TTGAATACGA 150 

AACCAAGAAC AGACACTATG CCCACGTTGA TTGTCCAGGA CACGCTGATT 200 

ATATCAAAAA CATGATTACT GGTGCTGCTC AAATGGATGG TGCTATCATT 250 

GTTGTTGCTG CTACTGACGG TCAAATGCCA CAAACCAGAG AACATTTATT 300 

GTTGGCAAGA CAAGTTGGTG TTCAAGACTT GGTTGTCTTT GTCAACAAAG 350 

TTGATACTAT TGATGACCCT GAGATGTTGG AATTAGTCGA AATGGAAATG 400 

AGAGAATTGT TGTCCACCTA CGGTTTTGAT GGTGACAACA CTCCTGTTAT 450 

TATGGGATCT GCTTTAATGG CCTTGGAAGG CAAAAAACCA GAAATTGGTA 500 

AGGAAGCTAT TTTGAGATTG TTAGATGCTG TCGATGAACA CATTCCAACT 550 

CCATCAAGAG ACTTGGAACA ACCATTTTTG TTGCCAGTTG AAGACGTGTT 600 

CTCCATCTCT GGTAGAGGAA CTGTTGTCAC CGGTAGAGTT GAAAGAGGTG 650 

TCTTGAAGAA GGGTGAAGAA ATCGAAATTG TTGGTGGTTT TGACAAACCA 700 

TACAAGACCA CTGTTACTGG TATTGAAATG TTCAAAAAGG AATTAGATTC 750 

TGCTATAGCT GGTGACAACT GTGGTGTTTT GTTGAGAGGT GTTAAAAGAG 800 

ATGAAATCAA GAGAGGTATG GTTTTGGCCA AGCCAGGTAC TGCTACTTCT 850 

CACAAGAAAT TTTTAGCATC TTTGTATATT TTGACTTCAG AAGAAGGTGG 900 

TCGTTCCACT CCATTTGGAG AAGGTTACAA GCCTCAATGT TTCTTCAGAA 950 

CTAATGACGT CACTACCACA TTTTCATTCC CAGAAGGAGA AGGTGTTGAC 1000 

CACTCCCAAA TGGTCATGCC AGGTGATAAC ATTGAAATGG TTGGTGAATT 1050 

GATCAAATCA TGTCCATTGG AAGT 1074 



2) INFORMATION FOR SEQ ID NO: 1563 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1033 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida famata 

(B) STRAIN: ATCC 62894 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1563 

GATCACGGGA AGACTACTTT GACCGCTGCC ATCACCAAAG TTTTAGCCGA 50 
AAAAGGTGGT GCTAACTTCT TGGACTACGG TTCTATCGAT AAAGCTCCAG 100 
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AAGAAAGAGC CAGAGGTATT ACTATTTCTG CTGCCCATGT TGAATACGAA 150 

ACTGACAAGA GACACTATGC CCATGTTGAT TGTCCAGGTC ACGCAGATTA 2 00 

TATCAAGAAT ATGATTACTG GTGCTGCTCA AATGGATGGT GCCATTATTG 250 

TTGTTGCTGC TTCCGATGGT CAAATGCCTC AAACCAGAGA ACATTTGTTA 300 

5 TTGGCCAGAC AAGTTGGTGT TCAACACTTG GTTGTTTTCG TCAACAAGGT 350 

CGACACCATT GACGATCCAG AAATGTTGGA ATTGGTTGAA ATGGAAATGA 4 00 

GAGATTTGTT AACTACTTAC GGTTTTGATG GTGATAACAC CCCAGTTATC 450 

ATGGGATCTG CTTTGTGTGC TTTGGAATCC AGAGAACCAG AAATTGGTCA 500 

AAAAGCCATT GAAAAATTGT TAGATGCCGT CGATGAATAC ATTCCAACCC 550 

10 CAGTCAGAGA CTTGGAACAA CCATTCTTGA TGCCAGTTGA AGAAGTTTTC 600 

TCCATTTCCG GTAGAGGTAC CGTTGTTGCT GGTAGAGTCG AAAGAGGTAC 650 

CTTGAAGAAG GGTGAAGAAA TCGAAATTGT TGGTGGTTTC GACAAGCCAT 700 

TCAAGGCCAC TGTTACTGGT ATTGAAATGT TCAAGAAGGA ATTGGACTCC 750 

GCTCTTGCTG GTGACAACTG TGGTATCTTG TTGAGAGGTG TCAAGAGAGA 800 

15 CGAAGTTAAG AGAGGTATGG TCTTGACCAA GCCAAACACC GTCACTTCCC 850 

ACAAGAAGAT CTTGGCCTCG TTGTATATCT TGACCAAGGA AGAAGGTGGT 900 

AGACACTCTC CATTTGGAGC CAACTACAAG CCCCAATTGT TCATGAGAAC 950 

CACCGATGTT ACCGGTACCA TGACCTTCCC AGAAGGTGCC GACCAATCTG 1000 

CCATGGTCAT GCCAGGTGAC AACGTTGAAA TGC 1033 

20 



2) INFORMATION FOR SEQ ID NO: 1564 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1056 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



30 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida glabrata 
35 (B) STRAIN: ATCC 66032 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1564 

GATCACGGTA AGACTACATT GACAGCTGCT ATCACCAAGA CATTGGCCAA 50 

4 0 GAACGGTGGT GCTGATTTCT TGGACTACTC TTCCATTGAC AAAGCTCCAG 100 

AGGAGAGAGC CCGTGGTATC ACTATCTCTA CTGCCCATGT CGAGTACGAG 150 

ACCGCCAAGA GACATTACTC CCACGTCGAC TGTCCAGGTC ACGCCGACTA 200 

CATCAAGAAC ATGATTACTG GTGCTGCCCA AATGGACGGT GCTATCATCG 250 

TTGTCGCCGC CACCGATGGT CAAATGCCAC AAACTAGAGA GCATTTGCTG 3 00 

45 TTGGCCAGAC AAGTCGGTGT TCAACGTATC GTTGTCTTTG TCAACAAGGT 350 

GGACACCATC GATGACCCTG AAATGTTGGA ATTAGTGGAA ATGGAAATGA 4 00 

GAGAATTGTT GAACGAATAC GGTTTTGACG GTGACAATGC CCCTATCATT 4 50 

ATGGGTTCCG CTTTGTGTGC CCTAGAAGGT CGTCAACCTG AAATTGGTGA 500 

GCAAGCTATC ATGAAACTAT TGGACGCTGT TGATGAATAC ATTCCAACCC 550 

50 CAGAAAGAGA CTTGAACAAG CCATTCTTGA TGCCTGTTGA AGACATCTTC 600 

TCCATCTCTG GTAGAGGTAC CGTCGTCACT GGACGTGTCG AAAGAGGTAA 650 

CTTGAAGAAG GGTGAAGAAG TTGAAATTGT TGGTCACAAC ACTACCCCAT 700 

TGAAGACCAC CGTTACTGGT ATCGAAATGT TCAGAAAGGA ATTGGACCAA 750 
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GCTATGGCTG GTGACAACGC 
CCAATTGAAG AGAGGTATGG 
ACACCAAGAT TTTGGCTTCT 
AGACATTCTG GTTTCGGTGA 
5 CGCAGATGTC ACTGTTGTGA 
CTATGCAAGT TATGCCAGGT 
CACCCA 



CGGTATCCTA TTGAGAGGTA 
TCATGGCCAA GCCAGGTACC 
TTGTACATCT TGTCTAAGGA 
AAACTACAGA CCTCAGATGT 
TGAAGTTCCC AGAATCTGTG 
GACAACGTCG AAATGGTCTG 



TCAGAAGAGA 800 

GTCAAGGCTC 850 

AGAAGGTGGT 900 

TTATCAGAAC 950 

GAAGACCACT 1000 

TGAACTAGTC 1050 
1056 



10 



20 



50 



2) INFORMATION FOR SEQ ID NO: 1565 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1061 bases 
15 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida guilliermondii 

(B) STRAIN: ATCC 6260 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1565 

GATCATGGTA AGACCACTTT GACCGCTGCT ATCACCAAGG TTTTGTCGGA 50 

AAAAGGAGGT GCTAATTTCT TGGATTACGG CTCCATCGAC AGAGCTCCAG 100 

AAGAGAGAGC CAGAGGTATC ACCATTTCCA CTGCCCATGT TGAGTACCAA 150 

30 ACTGATAAGA GACATTATGC CCACGTTGAC TGTCCAGGTC ACGCCGATTA 200 

CATTAAGAAT ATGATTACTG GTGCCGCCCA GATGGACGGT GCCATTATTG 250 

TTGTTGCTGC CACTGACGGT CAAATGCCTC AGACCAGAGA GCACTTGTTG 300 

TTGGCCAGAC AAGTTGGTGT GCAACACTTG GTAGTTTTTG TGAACAAGGT 350 

GGACACCATT GACGATCCCG AGATGTTGGA ATTGGTCGAG ATGGAAATGA 400 

35 GAGAATTGTT GAGTCAGTAC GGTTTCGATG GTGACAACAC CCCAGTTATC 450 

ATGGGATCTG CTTTGTGTGC TTTGGAAAGT AAGCAGCCAG AAATTGGTGT 500 

GCAAGCCATT GAAAAATTGT TGGACGCTGT CGATGAGCAC ATTCCTACTC 550 

CTACCCGTGA CTTGGAACAG CCATTCTTGT TGCCTGTTGA AGATGTGTTC 600 

TCCATTTCTG GTAGAGGAAC TGTGGTTACT GGTAGAGTCG AAAGAGGTTC 650 

4 0 GTTGAAGAAG GGTGAGGAAA TCGAGATTGT TGGTGACTTT GACAAGCCAT 700 

TCAAGACCAC TGTGACTGGA ATTGAAATGT TCAAGAAGGA ATTGGATGCT 750 

GCTATGGCTG GTGACAATGC TGGTATCTTG TTGAGAGGTG TCAAGAGAGA 800 

CGATGTCAAG AGAGGTATGG TTTTGGCCAA GCCTTCCACC GTCACTTCTC 850 

ACAAGAAGGT GTTGGCTTCC TTGTACATCT TGAGTAAGGA AGAAGGTGGC 900 

45 CGTCACTCTC CTTTTGGTGA GAACTACAAG CCTCAATTGT TCATCAGAAC 950 

TACTGACGTT ACCGGTACTT TAAGATTCCC AGCCGGCGAG GGTGTCGACC 1000 

ACTCGCAAAT GGTTATGCCA GGTGACAATG TTGAGATGGA AATTGAGCTT 1050 

GTGAGAAAGA C 1061 



2) INFORMATION FOR SEQ ID NO: 1566 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1073 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
5 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
10 (A) ORGANISM: Candida haemulonii 

(B) STRAIN: ATCC 22991 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1566 

15 GATCACGGTA AGACTACCTT GACTGCTGCT ATCACCAAGG TTTTGGCTTC 50 

TAAGGGTGGT GCTAGCTTCT TGGACTATGG TTCCATTGAC AGAGCCCCAG 100 

AGGAGAGAGC TAGAGGTATT ACTATTTCTA CTGCCCACGT TGAGTACCAA 150 

ACCGAAAAGA GACACTACGC CCACGTCGAC TGTCCAGGTC ACGCTGATTA 200 

CATTAAGAAT ATGATTACTG GTGCCGCCCA GATGGACGGT GCTATCATTG 250 

2 0 TTGTTGCTGC TTCTGATGGT CAGATGCCTC AGACCAGAGA GCACCTTTTG 300 

TTGGCCAGAC AGGTTGGTGT TCAGAACTTG GTTGTTTTCG TTAACAAGGT 350 

TGACACCATT GACGACCCTG AAATGTTGGA ATTGGTTGAG ATGGAAATGA 4 00 

GAGAATTGTT GACTACTTAC GGTTTTGACG GTGATGAGAC TCCTGTTATC 4 50 

ATGGGTTCTG CTTTGTGCGC TTTGGAAGAG AAGCAACCAG AGATTGGTGA 500 

25 GCAGGCTATC ATGAAGTTGT TGGACGCTGT CGATGAGTAC ATTCCAACCC 550 

CACAGCGTGA CTTGGAGCAG CCATTCTTGA TGCCTGTTGA GGATGTTTTC 600 

TCCATTTCTG GTAGAGGTAC TGTCGTTACT GGTAGAGTTG AGAGAGGTTC 650 

TTTGAAGAAG GGTGAGG AAA TCGAGATTGT CGGTGACTTC GCCAAGACTT 700 

TCAAGGCTAC CGTTACTGGT ATTGAGATGT TCAAGAAGGA ATTGGATGCT 750 

30 GCTATGGCTG GTGACAACGC CGGTATCTTG TTGAGAGGTG TCAAGAGAGA 800 

TGAGATCTCC CGTGGTGATG TCTTGGCCAA GCCAGGTACT GTTACTCCAC 850 

ACAAGAAGAT CTTGGCTTCT TTGTACGTTT TGACCAAGGA AGAAGGTGGT 900 

CGTCACAACC CATTCGCTGA GAACTACAAG CCACAGTTGT TCCTCAGAAC 950 

CACCAACGTC ACTGGTACCA TGAGATTCCC AGAAGGTGAA GATGTTGACC 1000 

35 ACTCTGCCAT GGTTAACCCA GGTGACAACG TTGAGATGGA AATCGAGTTG 1050 

GGTAGAAAGG CCCCACTTGA GTT 1073 



40 2) INFORMATION FOR SEQ ID NO: 1567 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1062 bases ' 

(B) TYPE: Nucleic acid 
45 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

50 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida kefyr 

(B) STRAIN: ATCC 28838 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1567 



CATGGTAAGA CCACTTTGAC TGCTGCCATC ACCAAGACTC TAGCTGAACG 50 

TGGTGGTGCT GACTTTTTGG ACTACTCTTC TATTGACAAG GCTCCAGAAG 100 

5 AAAGAGCYAG AGGTATCACT ATTTCTACTG CTCATGTTGA ATACGAGACT 150 

GAAAAGAGAC ATTACTCCCA CGTTGACTGT CCAGGTCACG CTGATTACAT 200 

CAAGAACATG ATTACTGGTG CTGCTCAAAT GGACGGTGCT ATTATTGTTG 250 

TTGCTGCTAC TGATGGTCAA ATGCCTCAAA CCAGAGAGCA TTTGTTGTTG 300 

GCCAGACAAG TTGGTGTCCA ACACATTGTT GTTTTCGTTA ACAAGGTTGA 350 

10 CACCATCGAT GATCCAGAAA TGTTGGAATT GGTTGAAATG GAAATGAGAG 4 00 

AATTRTTGAC TCAATATGGC TTTGACGGTG ACAACACTCC AGTGATCATG 450 

GGTTCTGCTT TGTGTGCCTT GGAAGGTAAG CAACCAGAAA TTGGTGAGCA 500 

AGCCATCATG AAGTTGTTGG ACGCTGTTGA CGAATACATC CCAACCCCAG 550 

CCCGTGACTT GGAAAAVCCA TTCTTGATGC CTGTTGAAGA TATCTTCTCC 600 

15 ATTTCCGGTA GAGGTACTGT CGTCACTGGT AGAGTTGAAC GTGGTAACTT 650 

GAAGAAGGGT GAAGAAATCG AAATTGTTGG TCACAACACC ACTCCTTTCA 700 

AGACTACTGT TACTGGTATT GAAATGTTCA GAAAGGAATT GGACCAAGCC 750 

ATGGCTGGTG ACAACGCTGG TGTCCTTTTG AGAGGTGTCA GAAGAGACCA 800 

ATTGAAGAGA GGTATGGTTT TGGCTAAGCC AGGTACTGTT AAGGCCCACA 850 

20 CCAAGTTCTT GGCTTCCTTG TACATTTTGA CCAAGGAAGA AGGTGGTAGA 900 

CACTCCGGTT TCGGTGAAAA CTACAGACCA CAAATCTACG TCAGAACTGC 950 

TGACGTTACC GTDGTCTTGA AGTTCCCAGA ATCTGTTGAA GACCATTCCA 1000 

TGCAAGTCAT GCCAGGTGAC AATGTCGAAA TGGAGTGTGA ATTGGTTCAC 1050 

CCAACTCCAT TG 1062 

25 



2) INFORMATION FOR SEQ ID NO: 1568 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1062 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



35 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida lusitaniae 
40 (B) STRAIN: ATCC 66035 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1568 

CGGAAAGACC ACCTTGACCG CCGCCATTAC CAAGGTTTTG GCTGACAAGG 50 

45 GTGGCGCCAA CTTCTTGGAC TACGGTGCCA TTGACAAGGC TCCTGAAGAA 100 

AGAGCACGTG GTATCACCAT TTCCACTGCC CACGTTGAAT ACGAGACCGA 150 

CAACAGACAC TACGCCCACG TTGACTGTCC AGGTCACGCC GATTACATCA 200 

AGAACATGAT CACGGGTGCC GCTCAAATGG ACGGTGCCAT TATTGTTGTT 250 

GCAGCCACCG ACGGCCAAAT GCCTCAAACC AGAGAGCACT TGTTGTTGGC 300 

50 CAGACAAGTT GGTGTGCAAC ACTTGGTTGT TTTCGTGAAC AAGGTTGATA 350 

CCATCGACGA CCCAGAAATG TTGGAATTGG TTGAAATGGA AATGAGAGAA 4 00 

TTGTTGACTC AATACGGATT TGACGGCGAT GAAACCCCTG TTGTTATGGG 4 50 

CTCTGCTCTT TGCGCTTTGG AAGGTAGAGA ACCAGAGATT GGTGAGCAAG 500 

810 
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CCATCACCAA GTTGTTGGAG GCTGTTGACG 
CGTGACTTGG AACAACCATT CTTGATGCCT 
TTCTGGTAGA GGTACTGTTG TCACTGGTAG 
AGAAGGGTGA GGAGATCGAG ATTGTTGGTG 
5 ACTACTGTTA CTGGTATTGA GATGTTCAAG 
GGCTGGTGAC AATGCTGGTA TCTTGTTGAG 
TTTCCCGTGG TATGGTTTTG GCCAAGCCAG 
AAGGTTTTGG CTTCTTTGTA CATTTTGTCT 
CTCTCCATTT GGCGAGAACT ACAAGCCTCA 
10 ATGTCACTGG TACTTTGAGA TTCCCAGCAG 
GCTATGGTTT CTCCAGGTGA CAATGTCGAG 
AAAGACTCCT CT 
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AGTACATCCC AACCCCACAA 550 

GTTGAAGATG TTTTCTCCAT 600 

AGTGGAGAGA GGTTCCTTGA 650 

ACTTTGACAA GCCTTTCAAG 700 

AAGGAATTGG ACGCTGCTAT 750 

AGGTGTCAAG AGAGAACAAG 800 

GCACTGTGAC CTCGCACAAG 850 

AAGGAAGAAG GTGGTCGTCA 900 

ATTGTTCCTT AGAACTACCG 950 

GTGAGGACGT TGACCACTCC 1000 

ATGGAAATCG AGTTGGTCAG 1050 

1062 



15 



2) INFORMATION FOR SEQ ID NO: 1569 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 990 bases 
20 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



25 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida sphaerica 

(B) STRAIN: ATCC 2504 

30 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1569 

TCACCAAGAC TTTGGCTGAA CGTGGTGGTG CTGATTTCTT GGACTACTCT 50 

35 TCCATTGACA AGGCTCCAGA AGAAAGAGCR AGAGGTATCA CTATTTCTAC 100 

TGCACATGTT GAATATGAAA CTGACAAGAG ACATTACTCT CACGTCGACT 150 

GTCCAGGTCA TGCTGATTAC ATCAAGAATA TGATTACTGG TGCTGCCCAA 200 

ATGGATGGTG CTATCATTGT TGTTGCTGCT ACAGATGGTC AAATGCCTCA 250 

AACCAGAGAA CATTTGTTGT TGGCTAGACA AGTTGGTGTT CAACAYATCG 300 

4 0 TTGTTTTCGT TAACAAGGTT GACACTATCG ATGACCCTGA AATGTTAGAA 350 

TTGGTTGAAA TGGAAATGAG AGAATTATTG ACCCAATACG GTTTCGATGG 4 00 

TGACAACACT CCAGTCATCA TGGGTTCTGC TTTGTGTGCT TTAGAAGGTA 450 

AGCAACCAGA AATTGGTGAA CAAGCAATCA TGAAGTTATT GGACGCTGTT 500 

GACGAATACA TCCCAACTCC AGCTCGTGAT TTGGAAAAGC CTTTCTTGAT 550 

4 5 GCCTGTTGAA GATATCTTCT CCATCTCCGG TAGAGGTACC GTCGTAACTG 600 

GTAGAGTTGA ACGTGGTAAC TTGAAKAAGG GTRAAGAAAT CGAAATCGTT 650 

GGTCACAACA CCACTCCATT CAAGACCACT GTTACTGGTA TTGAAATGTT 700 

CAGAAAGGAA TTGGACCAAG CTATGGCTGG TGATAACGCT GGTGTCWTGT 750 

TGAGAGGTGT CAGAAGAGAC CAATTAAAGA GAGGTATGGT CTTGGCCAAG 800 

50 CCAGGTACTG TCAAGGCTCA CACCRAATTC TTGGCCTCTT TGTATATCTT 850 

GACCAAGGAA GAAGGTGGTA GACATTCCGG TTTCGGTGAA AATTACAGAC 900 

CTCAAATCTA CGTTAGAACT GCTGATGTCA CCGTTGTTTT GAAGTTCCCA 950 

GAAGCTGTTG AAGATCACTC TATGCAAGTC ATGCCAGGTG 990 

811 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



2) INFORMATION FOR SEQ ID NO: 1570 



(i) 



SEQUENCE CHARACTERISTICS : 

(A) LENGTH : 1184 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



5 



(ii) 



MOLECULE TYPE: Genomic DNA 



10 



(vi) 



ORIGINAL SOURCE: 

(A) ORGANISM: Candida tropicalis 

(B) STRAIN: ATCC 750 



15 



(xi) 



SEQUENCE DESCRIPTION: SEQ ID NO: 1570 



GATCATGGTA AAACCACTTT GACTGCTGCC ATTACTAAAG TCTTGGCTGA 
TAAAGGTCAA GCTAACTTCT TAGATTACGG TTCTATTGAT AGAGCTCCAG 
AAGAAAGAGC CAGAGGTATC ACCATCTCTA CTGCCCACGT TGAATATGAA 

20 ACCGAAAAAA GACATTACGC CCATGTTGAG TATGTATACT TTTTTTTGTT 
GTGTAATTGT TTTAAAGATT TTCTTTAAAG CTGAAGAAGT CAAATCAGTT 
CTTTGATAAC TTCTATTAAA AAAAGGGAAA AATTAACAAG ATATACTAAC 
ACTATAACAG TTGTCCTGGA CATCAAGATT ATATCAAGAA TATGATTACC 
GGTGCCGCTC AAATGGATGG TGCTATTATT GTTGTTGCTG CCACTGATGG 

25 TCAAATGCCA CAAACCAGAG AACATTTGTT GTTGGCTAGA CAAGTCGGTG 
TTCAAGATTT GGTTGTCTTT GTTAACAAAG TCGACACTAT TGATGACCCA 
GAAATGTTGG AATTGGTTGA AATGGAAATG AGAGAATTAT TGACTACTTA 
CGGTTTTGAT GGTGATAACA CTCCTGTTAT CATGGGTTCT GCTTTGATGG 
CCTTGCAAGG TAAACAACCA GAAATTGGTG AACAAGCTAT CATGAAATTG 

30 ATGGACGCTA TTGATGAACA CATTCCAACC CCAACCAGAG ACTTGGAACA 
ATCTTTCTTG ATGCCAGTTG AAGATGTTTT CTCCATTTCT GGTAGAGGTA 
CTGTTGTTAC TGGTAGAGTC GAAAGAGGTG TCTTAAAGAA GGGTGAAGAA 
ATTGAAATTG TTGGTGGTTT CGAAAAACCA TTCAAGACCA CTGTTACTGG 
TATTGAAATG TTCAAGAAAG AATTAGATGC TGCTATGGCT GGTGACAACT 

35 GTGGTGTCTT GTTGAGAGGT GTCAAGAGAG ACGAAATCAA GAGAGGTATG 
GTTTTGGCTA AACCAGGTAC TGCTACTTCC CACAAGAAAT TCTTGGCTTC 
CATGTATATC TTAACTGCTG AAGAAGGTGG TCGTTCCACT CCATTCGGTG 
AAGGTTACAA GCCACAATGT TTCTTCAGAA CTAACGATGT TACCACTTCC 
TTCTCTTTCC CAGAAGGTGA AGGTGTTGAC CACTCCCAAA TGGTTATGCC 

40 AGGTGACAAC ATTGAAATGG TCGGTGAATT GATT 



2) INFORMATION FOR SEQ ID NO: 1571 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
850 
900 
950 
1000 
1050 

iioo 

1150 
1184 



45 



50 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1071 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: ' 

(A) ORGANISM: Candida vi< 

(B) STRAIN: ATCC 28269 

(xi) SEQUENCE DESCRIPTION: SEQ 

CGATCACGGT AAGACCACCT TGACCGCCGC 
ACAAGGGTCA GGCTAACTTC TTGGACTACG 
GAAGAAAGAG CAAGAGGTAT CACTATCTCC 
GACTGATAAG AGACACTATG CCCACGTTGA 
ATATCAAGAA TATGATCACT GGTGCTGCCC 
GTTGTTGCTG CTACTGACGG TCAGATGCCA 
GTTGGCTAGA CAAGTTGGTG TCCAAGACTT 
TTGACACTAT CGATGACCCA GAAATGTTGG 
AGAGAATTAT TATCTTCTTA YGGCTTTGAC 
CATGGGTTCC GCTTTGATGG CTTTGCAAGG 
AACAAGCTAT TATTAAGTTG ATGGACGCTA 
CCAACCAGAG ACTTGGAACA ACCATTCTTG 
TTCTATTTCC GGTAGAGGTA CCGTCGTCAC 
TCTTGAAGAA GGGTGAAGAA ATTGAAATTG 
TTCAAGACCA CCGTTACTGG TATTGAAATG 
TGCTATGGCT GGTGACAACT GTGGTGTCTT 
ACGAAATCAG CAGAGGTATG GTTTTGGCCA 
CACAAGAAGT TCTTGGCCTC CATGTACATC 
TCGTCGTACC CCATTCGGTG AAGGTTACAA 
CCAATGACAT CACCACCACT TTCACTTTCC 
CACTCCCAAA TGGTTATGCC AGGTGACAAC 
GTACAAGGCT TGTCCTTGGA A 



nathii 



ID NO: 1571 



CATCACCAAG 


GTCTTGGCCG 


50 


GATCCATTGA 


CAGAGCCCCT 


100 


ACTGCCCACG 


TTGAATACGA 


150 


TTGCCCGGGC 


CATCAAGATT 


200 


AAATGGACGG 


TGCTATCATT 


250 


CAAACCAGAG 


AACACTTGTT 


300 


GGTTGTTTTC 


GTTAACAAGG 


350 


AATTGGTTGA 


AATGGAAATG 


400 


GGTGACAACA 


CCCCAGTTGT 


450 


TAAGCAACCA 


GAAATTGGTG 


500 


TTGATGAACA 


CATYCCAACC 


550 


TTGCCAGTTG 


AAGATGTCTT 


600 


TGGTAGAGTC 


GAAAGAGGTG 


650 


TCGGTAACTT 


TGAAAAGCCA 


700 


TTCAAGAAGG 


AATTGGATGC 


750 


GTTGAGAGGT 


GTCAAGAGAG 


800 


AGCCAGGTAC 


CGTCACTTCC 


850 


TTGACTGGTG 


AAGAAGGTGG 


900 


GCCACAATGT 


TTCTTCAGAA 


950 


CAGAAGGTGA 


AGGTGTCGAC 


1000 


ATCGAAATGG 


TTGGTGAATT 


1050 
1071 



2) INFORMATION FOR SEQ ID NO: 1572 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 817 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Alcaligenes faecalis subsp. faecalis 

(B) STRAIN: ATCC 8750 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1572 

TATCTTGGTT WGCTCGGCCG CTGACGGCCC AATGCCTCAG ACTCGCGAGC 50 

ACATCCTGCT GAGCCGTCAG GTTGGCGTTC CTTACATCAT CGTGTTCCTG 100 

AACAAGGCCG ACATGGTTGA TGACGAAGAG CTGATCGAAC TGGTTGAAAT 150 

GGAAGTTCGC GAGCTGTTGT CCAAGTACGA CTTCCCTGGC GACGACACCC 200 

CGATCATCAA GGGTTCGGCC AAACTGGCTC TGGAAGGCGA CGAAGGCCCA 250 

CTGGGCAGCC AAGCCGTTCT GGCTCTGGCC GAAGCGCTGG ACAACTACAT 300 

813 
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TCCTACGCCT GAGCGTGCCG TTGACGGTAC 
ACGTGTTCTC GATCTCCGGC CGTGGTACGG 
CGCGGCATCA TCAAGGTCGG CGAAGAAATC 
CACGGTCAAG ACCATTTGTA CCGGCGTTGA 
5 ACCAGGGCGA AGCTGGCGAT AACGTCGGTC 
CGTGAAGACG TGGAACGTGG TCAAGTTCTG 
GCCACACACT GACTTCGACG CCGAGGTGTA 
GTGGTCGTCA CACTCCTTTC TTCAAGGGCT 
CGTACAACTG ACGTGACCGG CACCATCGAG 
10 GGTTCTGCCA GGCGACAACA TTTCGATGAA 
TCGCCATGGA AGAAGGT 
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GTTCCTGATG CCTGTTGAAG 3 50 

TTGTGACCGG TCGTATTGAG 4 00 

GAAATCGTGG GTATCAAAGA 4 50 

AATGTTCCGC AAACTGCTGG 500 

TGCTGCTGCG TGGTACCAAG 550 

GCCAAGCCAG GCTCGATCAA 600 

CATTCTGTCC AAAGAAGAAG 650 

ACCGTCCTCA GTTCTACTTC 700 

CTGCCAGAAG ACAAGGAAAT 750 

AGTGTCCCTG ATCGCTCCTA 800 

817 



15 2) INFORMATION FOR SEQ ID NO: 1573 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 796 bases 

(B) TYPE: Nucleic acid 
20 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

25 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Prevotella buccalis 

(B) STRAIN: ATCC 35310 



30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1573 



TATCCTTGTA GTTGCTGCTA CTGATGGTCC TATGCCACAG ACACGTGAGC 50 

ACGTGCTTTT GGCTCGTCAG GTAAACGTTC CTCGTTTGGT TGTGTTCATG 100 

AACAAGTGTG ACTTGGTAGA AGACGAAGAG ATGCTTGAAC TCGTTGAAAT 150 

GGAGTTGCGC GAACTTCTTG AGCAATACGA ATTCGAAGAG GATACTCCAA 200 

35 TCGTTCGTGG -TTCTGCACTG GGTGCATTGA ATGGTGTTGA CAAGTGGGTT 250 

GACAGCGTGA TGACGTTGAT GGACACTGTT GACGAGTGGA TTCAAGAGCC 300 

AGAGCGTGAC CTTGACAAAC CTTTCTTGAT GCCAGTAGAG GACGTGTTCT 350 

CTATCACAGG TCGTGGTACC GTTGTAACAG GACGTATTGA GACTGGTAAG 4 00 

GTAAAGGTTG GCGACGAGAT TCAGTTGCTC GGTCTTGGTG AGGACAAGAA 450 

40 GTCTGTTGTA ACAGGCGTTG AAATGTTCCG TAAGATTCTT TCTGAAGGTG 500 

AAGCAGGTGA TAACGTAGGA CTTTTGCTCC GCGGTATCGA TAAGGATGAA 550 

GTAAAGCGTG GTATGGTTGT TGTACACCCA GGTGCCATCA CTCCTCACGA 600 

TCACTTCAAG GCTTCCATCT ATGTATTGAA GAAGGAAGAG GGTGGACGTC 650 

ATACTCCATT CGGAAACAAG TATCGTCCTC AGTTCTATCT CCGTACAATG 700 

45 GACTGTACAG GTGAGATCAC TTTGCCAGAA GGCGTAGAGA TGGTGATGCC 750 

TGGTGACAAC GTAGAGATTG AGGTTACCTT GATTTACAAG GTTGCC 796 



50 2) INFORMATION FOR SEQ ID NO: 1574 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 820 bases 

814 
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(B) TYPE: Nucleic" acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Svccinivibrio dextrinosolvens 

(B) STRAIN: ATCC 19716 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1574 

GCTATTCTAG TAGTAGCAGC AACTGATGGT CCTATGCCAC AGACCCGTGA 50 

GCACATCCTA TTAGCACGTC AGGTAGGCGT ACCATACATC ATCGTATTCC 100 

TAAACAAGTG CGATATGGTT GACGACGAGG AATTATTAGA GTTAGTTGAG 150 

ATGGACGTAC GTGATCTATT AAATCAGTAC CAGTTCCCAG GCGACGACAC 200 

TCCAATCATC CGTGGTTCAG CACTAGGTGC ATTAAACGGC GAAGAGAAGT 250 

GGAAAGAGGC AATCTATCAG TTAGCAGACA CTCTAGATTC ATACATTCCA 300 

GAGCCAAAGC GTGATATCGA TGATCCATTC CTATTACCAA TCGAAGATAT 350 

CTTCTCAATC TCAGGTCGTG GTACTGTAGT AACCGGCCGT GTAGAGCGTG 400 

GTATTGTACA CGTAGGTGAC GAAGTTGAAA TCGTTGGTAT TCGTCCAACC 450 

ACCAAGACCA CTGTAACTGG CGTTGAAATG TTCCGTAAGT TACTAGACGA 500 

AGGTCGTGCA GGTGATAACG TTGGTGTTCT ACTACGTGGT ACCAAGCGTG 550 

ATGAGGTTGA GCGTGGTCAG GTTCTAGCTG CTCCAGGCAC AATCACTCCA 600 

CACACCAAGT TCACTGGTCA GGTTTACGTA CTAAGCAAGG ATGAAGGTGG 650 

TCGTCACACT CCATTCTTCA AGGGCTACCG TCCACAGTTC TTCTTCCGTA 700 

CAACCGATAT TACCGGTTCT ATCGATCTGA AAGAGGGCGT AGAGATGGTA 750 
ATGCCAGGTG ATAACACCGA CATGACCGTA ACCCTAATCC ACCCAGTAGC * 800 

TATGGCTGAA GGCGAGAGAT 820 



2) INFORMATION FOR SEQ ID NO: 1575 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 803 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Tetragenococcus halophilus 

(B) STRAIN: ATCC 33315 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1575 

GGTAGTGTCT GCAGCTGACG GCCCAATGCC ACAAACTCGT GAACATATCC 50 

TATTGTCACG TAACGTTGGT GTACCATACA TCGTTGTATT CTTAAACAAA 100 

ATGGATATGG TTGACGATGA AGAATTACTT GAATTAGTTG AAATGGAAGT 150 

ACGTGATCTA TTAACTGAAT ACGACTTCCC AGGTGATGAT ACTCCTGTTA 200 

TTTCTGGTTC AGCTTTGAAA GCTTTAGAAG GCGACGAAGA ATATGAACAA 250 

815 
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AAAGTCTTAG 
ACGTGACCAT 
TCACTGGTCG 
AAAGTCGGTG 
GACAACTGTT 
AAGCTGGCGA 
ATCGAACGTG 
AAAATTCTCA 
ATACTCCATT 
GACATCACTG 
AGGTGATAAC 
AAA 



ACTTAATGGC 
GACAAACCGT 
TGGAACTGTT 
ATGAAGTTGA 
ACGGGTGTAG 
TAACATCGGT 
GTCAAGTGTT 
GCTGAAGTTT 
CTTCTCAAAC 
GTGTCATTGA 
GTAGCAATGG 



AGCTGTAGAT GATTTCATCC CAACTCCTGA 
TCATGATGCC GATTGAAGAT GTTTTCTCAA 
GCTACAGGTC GTGTTGAACG TGGAACTATT 
TATCATCGGT ATTCATGAAA ATGTTAAAAA 
AAATGTTCCG TAAATTGTTG GATTACGCTG 
ACATTATTGC GTGGTGTTTC TCGTGATGAT 
GGCTAAACCA GGCACAATCA CACCACATAC 
ATGTATTAAC AAAAGAAGAA GGCGGACGTC 
TATCGCCCAC AATTTTACTT CCGTACAACT 
ATTGCCAGAA GGTACTGAAA TGGTTATGCC 
AAGTTGAATT AATTCACCCT GTTGCTATTG 



300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
803 



2) INFORMATION FOR SEQ ID NO: 1576 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 805 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Campylobacter jejuni subp. jejuni 

(B) STRAIN: ATCC 33292 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1576 

TTAATGAAGC AATTGTTGTA AATTTTGAAA GCGAAGGAAA AAAACATAAA 50 

CTTGTTTTAG AAGTAGCAGC TCATTTAGGA GATAATAGAG TTAGAACTAT 100 

TGCTATGGAT ATGACAGATG GTTTGGTAAG GGGCTTAAAA GCTGAGGCTT 150 

TAGGTGCTCC TATTAGTGTT CCTGTTGGTG AGAAAGTTTT AGGAAGAATT 200 

TTCAATGTTA CTGGAGATTT GATCGATGAA GGTGAAGAAA TTTCTTTTGA 250 

TAAAAAATGG GCAATTCATA GAGATCCGCC AGCTTTTGAA GATCAAAGCA 300 

CAAAAAGTGA GATTTTTGAA ACAGGGATTA AAGTTGTAGA TTTGCTTGCT 350 

CCTTATGCAA AAGGTGGTAA AGTAGGTCTT TTTGGTGGTG CAGGTGTTGG 400 

TAAAACTGTT ATTATTATGG AGCTTATTCA CAATGTTGCA TTTAAGCATA 450 

GCGGCTATTC TGTATTTGCA GGTGTGGGTG AGAGAACTCG TGAAGGAAAT 500 

GACCTTTATA ATGAAATGAA AGAAAGTAAT GTTTTAGACA AAGTTGCTCT 550 

ATGTTATGGA CAAATGAATG AACCACCAGG AGCAAGAAAT CGTATTGCTT 600 

TAACAGGTTT AACAATGGCT GAGTATTTTA GAGATGAAAT GGGTCTTGAT 650 

GTGCTTATGT TTATTGATAA TATCTTTAGA TTTTCACAAT CAGGTTCTGA 700 

AATGTCAGCA CTTTTAGGAA GAATTCCATC AGCTGTGGGT TATCAACCAA 750 

CCCTAGCAAG TGAAATGGGT AAATTCCAAG AAAGAATTAC TTCAACTAAA 800 

AAAGG 805 



2) INFORMATION FOR SEQ ID NO: 1577 



816 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 809 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
5 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
10 (A) ORGANISM: Campylobacter rectus 

(B) STRAIN: ATCC 33238 

(xi) SEQUENCE DESCRIPTION:' SEQ ID NO: 1577 

15 ATTTACCCAA GATCAACGAA GCCGTCGAAG TAAATTTCGA GGTTGAGGGC 50 

AAGCAAAACA GACTTGTGTT AGAGGTTGCC GCACACCTTG GCGATAACCG 100 

CGTAAGAACG ATCGCTATGG ATATGAGCGA AGGCTTAACC AGAGGCCTTG 150 

AGGCTACGGC TCTTGGCGCG CCTATTAGCG TTCCGGTCGG CGAAAAAGTT 200 

TTGGGAAGGA TTTTTAACGT CGTCGGCGAT CTGATCGACG AGGGTGAAGG 250 

20 CATAGAATTT GATAAAAAAT GGTCTATCCA CCGCGATCCT CCGCCGTTTG 300 

AAGAGCAAAG CACGAAGAGT GAAATTTTTG AAACGGGTAT AAAAGTGGTC 350 

GATCTTCTAG CCCCTTATGC AAAAGGCGGT AAGGTCGGAT TATTCGGCGG 400 

TGCCGGCGTC GGTAAGACGG TTATTATCAT GGAGCTTATC CACAACGTTG 450 

CATTTAAGCA TAGCGGTTAT TCCGTGTTTG CCGGCGTGGG CGAGCGAACC 500 

25 CGCGAAGGAA ACGACCTTTA TCACGAGATG AAAGAGAGTA ACGTTTTGGA 550 

CAAAGTCGCC TTGTGCTACG GCCAGATGAA CGAGCCGCCG GGAGCAAGAA 600 

ACCGCATCGC TCTAACAGGC CTAACGATGG CTGAATACTT CCGCGACGAG 650 

ATGGGACTTG ACGTTTTGAT GTTTATAGAC AACATCTTCC GTTTCTCTCA 700 

GTCTGGCGCT GAGATGTCGG CGCTTCTTGG ACGTATCCCG TCAGCCGTTG 750 

30 GTTATCAGCC GACTTTGGCG AGCGAAATGG GCAAATTCCA AGAGAGAATT 800 

ACATCAACC 809 



35 2) INFORMATION FOR SEQ ID NO: 1578 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1671 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

4 5 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus casseliflavus 

(B) STRAIN: ATCC 25788 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1578 

AATCGAAGTG CAACGTTCAT TGCGTGTACT TGATGGTGCT GTAACTGTGT 50 
TGGACTCACA ATCAGGTGTA GAACCTCAAA CAGAAACAGT TTGGCGTCAA 100 
GCGACAGATT ACCGCGTACC GCGTGTTGTA TTCTGTAACA AAATGGACAA 150 

817 
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AATTGGTGCA GACTTCTTAT ACTCTGTATC 
AAGCAAATGC TCACCCAATC CAATTACCAA 
ACTGGTATTA TCGACTTAGT AAAAATGAAA 
CTTAGGAACT GAAATCCAAG AGACTGAAAT 
TAGCTGAAGA ATGGCGCGAA AAATTAATTG 
GAAGAACTAA TGATGAAATT CTTGGAAGGT 
ATTGAAAGCT GGTATTCGTC AAGCAACATT 
TTCTTTGCGG ATCTGCCTTT AAAAACAAAG 
GCAGTCATCG ACTACTTGCC TTCACCACTT 
GATCAATCCT AAAACAGACG AAGAAACTGA 
CACCATTTGC TTCATTAGCA TTTAAAGTAA 
CGTTTGACAT TCTTYCGTGT GTATTCARGT 
CGTATTGAAT GCTTCAAAAG GCAAACGCGA, 
AAATGCACGC CAACACTCGT GCTGAAATCC 
ATCGCCGCTG CTGTTGGTTT GAAAGACACA 
TGATGAAAAA TCCCCAGTAA TCCTTGAATC 
TTATCGAAGT CGCTGTTGAG CTTAAATCAA 
GGGGTTGCTT TACAAAAACT TGCTGAAGAA 
AMCAAACGCT GAAACAGGCG AAACTGTTAT 
ACTTGGACGT CTTAGTTGAC CGTATGCGTC 
AACGTAGGTG CGCCTCAAGT TTCTTATCGT 
ACAAGCGGAA GGTAAATTTG TACGTCAGTC 
GTCACGTATG GGTCGAATTT ACACCAAACG 
TTCGAAAACG CGATTGTCGG TGGTGTGGTT 
AGTTGAAAAA GGACTTGAAG AATCAATGGC 
ACCCATTAGT AGACATCAAA GCAAAACTTT 
GTCGATTCAA GTGAAACTGC CTTCCGTGTT 
TGCTGCAGCG AAGAAAGCAA ACCCAGTAAT 
TAGTTATCAC TGTACCAGAA GATTACTTAG 
ACTGCTCGTC GTGGACGCGT AGAAGGAATG 
AATCGTGAAC GCAATCGTGC C 
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AACTTTACAT GATCGTTTAC 200 

TTGGTGCGGA AGATGACTTT 250 

GCTGAAATCT ACACAAATGA 300 

TCCTGAAGAA TACGTAGAAT 350 

AAGCTGTTGC TGATACTGAT 400 

GAAGAAATCA CTGAAGAAGA 450 

GACTGTTGAC TTTTTCCCTG 500 

GGGTTCAATT GATGTTGGAT 550 

GATGTTCCTG CGATTAAAGG 600 

TCGTCCGGCT GACGATGAAG 650 

TGACTGACCC ATTCGTARGT 700 

ATCTTGAACT CTGGATCATA 750 

ACGTATCGGT CGGATCCTAC 800 

AAACAGTATA CTCAGGCGAT 850 

ACAACTGGGG ATCCACTATG 900 

AATCGAATTC CCAGAACCAG 950 

AAGCTGACCA AGATAAAATG 1000 

GATCCATCAT TCCGTGTGGA 1050 

CGCTGGTATG GGAGAACTTC 1100 

GCGAATTTAA AGTTGAAGCA 1150 

GAAACATTCC GTGCAGCAAC 1200 

TGGTGGTAAA GGTCAATACG 1250 

AAGAAGGTAA AGGCTTCGAA 1300 

CCTCGTGAAT ACATCCCAGC 1350 

GAACGGTGTC TTAGCCGGTT 1400 

ATGATGGTTC ATACCATGAT 1450 

GCAGCTTCTA TGGCTTTACG 1500 

TCTTGAACCA ATGATGAAAG 1550 

GTGATGTTAT GGGTCACGTA 1600 

GAAGCACWCG GTAACTCACA 1650 

1671 



2) INFORMATION FOR SEQ ID NO: 1579 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1662 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus gallinarum 

(B) STRAIN: ATCC 49573 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1579 

GAAGTACAAC GTTCACTACG GGTTCTTGAC GGWGCTGTAA CAGTATTGGA 50 
CTCACAATCT GGTGTAGAAC CACAAACTGA AACAGTTTGG CGTCAAGCGA 100 
CAGATTACCG CGTACCACGT ATCGTATTCT GTAACAAAAT GGATAAAATC 150 
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GGTGCAGACT TCTTATACTC TGTATCTACT 
AAATGCTCAT CCAATCCAAT TACCAATTGG 
GTATCATCGA TCTAGTAAAA ATGAAAGCTG 
GGAACAGAGA TTCAAGAAAC TGAAATTCCT 
5 TGAAGAATGG CGCGAAAAAT TAGTTGAAGC 
AGCTAATGAT GAAATTCTTG GAAGGTGAAG 
AAAGCTGGTA TCCGTCAAGC GACATTGACT 
TTGTGGTTCA GCCTTCAAAA ACAAAGGGGT 
TCATCGACTA CCTTCCTTCA CCACTTGATG 

10 AATCCTAAAA CTGACGAAGA AACTGATCGT 
TTTTGCTTCA CTAGCATTTA AAGTAATGAC 
TGACATTCTT CCGTGTGTAT TCAGGTGTCT 
TTGAATGCTT CAAAAGACAA ACGCGAACGT 
GCACGCGAAC ACTTGTGCAG AAATCCAAAC 

15 CTGCAGCTGT TGGTTTGAAA GATTCCACAA 
AAAAATCACC CAGTAATCCT TGAATCAATC 
CGAAGTAGCT GTTGAACYTA AATCAAAAGC 
TGGCTTTACA AAAACTTGCT GAAGAAGATC 
AACGCTGAAA CTGGCGAAAC TGTTATCGCA 

20 GGACGTCCTT GTTGACCGTA TGCGTCGCGA 
TTGGTGCGCC TCAAGTTTCT TACCGCGAAA 
GCCGAAGGTA AATTTGTACG TCAGTCTGGT 
CGTATGGATC GAATTTACAC CAAACGAAGA 
AAAACGCAAT TGTCGGTGGT GTGGTTCCAC 

25 GAAAAAGGAC TTGAAGACTC AATGGCTAAC 
ATTGGTTGAC ATCAAAGCCA AGCTTTACGA 
ATTCAAGTGA AACAGCCTTC CGTGTGGCAG 
GCAGCGAAGA AAGCTAATCC AGTGATTCTT 
TATCACTGTT CCTGAAGATT ACTTAGGTGA 

30 CTCGTCGTGG ACGTGTTGAA GGTATGGAAG 
GTTAACGCGA TT 



PCT/CAOO/01150 

TTACATGATC GCTTGCAAGC 200 

TGCGGAAGAT GACTTTACTG 250 

AGATCTATAC AAACGATTTA 300 

GAAGAGTACA AAGAATTAGC 350 

TGTTGCAGAT ACTGACGAAG 400 

AAATCACTGA AGAAGAATTG 4 50 

GTTGAATTTT TCCCAGTTCT 500 

TCAATTGATG TTGGATGCAG 550 

TTCCTGCAAT CAAAGGGATC 600 

CCTGCTGACG ATGAAGCGCC 650 

TGACCCATTC GTAGGTCGTT 700 

TGAACTCTGG ATCATATGTC 750 

ATCGGTCGTA TTCTGCAAAT 800 

AGTTTATTCA GGCGATATCG 850 

CAGGGGATAC ATTGTGTGCG 900 

GAATTCCCAG AMCCAGTTAT 950 

TGACCAAGAT AAAATGGGTG 1000 

CTTCATTCCG TGTAGAAMCA 1050 

GGGATGGGTG AACTTCACTT 1100 

ATTTAAAGTT GAAGCAAACG 1150 

CATTCCGTGC TTCTACGCAA 1200 

GGTAAAGGTC AATACGGTCA 1250 

AGGTAAAGGC TTCGAATTCG 1300 

GTGAATACAT CCCAGCAGTT 1350 

GGTGTTCTAG CTGGTTATCC 1400 

TGGTTCATAC CATGATGTCG 1450 

CTTCAATGGC TTTACGTGCT 1500 

GAACCAATGA TGAAAGTTGT 1550 

TATTATGGGA CACGTAACTG 1600 

CGCATGGTAA CTCACAAATC 1650 

1662 



35 2) INFORMATION FOR SEQ ID NO: 1580 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1669 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

45 (Vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus mitis 

(B) STRAIN: ATCC 49456 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1580 

CAATCGAAGT ACAACGTTCT CTTCGTGTAT TGGATGGTGC TGTTACCGTT 50 
CTTGACTCAC AATCAGGTGT TGAGCCTCAA ACTGAAACAG TTTGGCGTCA 100 
AGCAACTGAG TATGGAGTTC CACGTATCGT ATTTGCCAAC AAAATGGACA 150 
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AAATCGGTGC TGACTTCCTT TACTCTGTAA GCACACTTCA CGATCGTCTT 200 

CAAGCAAATG CACACCCAAT CCAATTGCCA ATCGGTTCTG AAGATGACTT 250 

CCGTGGTATC ATCGACTTGA TCAAGATGAA AGCTGAAATC TATACTAACG 300 

ACCTTGGTAC AGATATCCTT GAAGAAGACA TCCCAGCTGA ATACCTTGAC 350 

5 CAAGCTCAAG AATACCGTGA AAAATTGATC GAAGCAGTTG CTGAAACTGA 400 

CGAAGAATTG ATGATGAAAT ACCTCGAAGG TGAAGAAATC ACTAACGAAG 4 50 

AATTGAAAGC TGGTATCCGT AAAGCGACTA TCAACGTTGA ATTCTTCCCA 500 

GTATTGTGTG GCTCTGCCTT CAAGAACAAA GGTGTTCAAT TGATGCTTGA 550 

TGCGGTTATT GACTACCTTC CAAGCCCACT TGACATCCCA GCGATCAAAG 600 

10 GTATCAACCC AGATWCAGAT GAAGAAGAAA CTCGTCCAGC ATCTGATGAA 650 

GAGCCATTTG CAGCTCTTGC CTTCAAGATC ATGACAGACC CATTCGTAAG 700 

TCGTTTGACA TTCTTCCGTG TTTACTCARG TGKTCTTCAA TCAGGTTCAT 750 

ACGTATTGAA CACTTCTAAA GGKAAACGTG, AGCGTATCGG ACGTATCCTT 800 

CAAATGCACG CCAACAGCCG TCAAGAAATT GACACTGGTT ACTCAAGAGA 850 

15 TATCGCTGCT GCCGTTGGTT TGAAAGATAC TCCAACTGGT GACTCATTGC 900 

CAGATGAAAA AGCTAAAATC ATTCTTGAGT CAATCAACGT TCCAGAMCCA 950 

GTTATCCAAT TGATGGTTGA GCCAAAATCT AAAGCTGACC AAGCCAAGAT 1000 

GGGTATCGCC CTTCAAAAAT TGGCTGAAGA AGATCCAACA TTCCGCGTTG 1050 

AAMCAAACGT TGAAACTGGT GAAMCAGTTA TCTCTGGTAT GGGTGAGCTT 1100 

20 CACCTTGACG TCCTTGTTGA CCGTATGCGT CGTGAGTTCA AAGTTGAAGC 1150 

AAACGTAGGT GCTCCTCAAG TATCTTACCG TGAAACATTC CGCGCTTCTA 1200 

CTCAAGCACG TGGATTCTTC AAACGTCAGT CTGGTGGTAA AGGTCAATTC 1250 

GGTGATGTAT GGATTGAATT TACTCCAAAC GAAGAAGGTA AAGGATTCGA 1300 

ATTCGAAAAC GCAATCGTCG GTGGTGTGGT TCCTCGTGAA TTTATCCCAG 1350 

25 CGGTTGAAAA AGGTTTGGTA GAATCTATGG CTAACGGTGT ACTTGCAGGT 14 00 

TACCCAATGG TTGACGTTAA AGCTAAGCTT TACGATGGTT CATACCACGA 1450 

TGTCGACTCA TCTGAAACTG CCTTCAAGAT TGCGGCTTCA CTTGCTCTTA 1500 

AAGAAGCTGC TAAATCAGCA CAACCAGCTA TCCTTGAGCC AATGATGCTT 1550 

GTAACAATCA CTGTTCCAGA AGAAAACCTT GGTGATGTTA TGGGTCACGT 1600 

30 AACTGCTCGT CGTGGACGTG TAGATGGTAT GGAAGCACWC GGTAACAGCC 1650 

AAATCGTTCG TGCTTACGT 1669 



35 2) INFORMATION FOR SEQ ID NO: 1581 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1581 

AATTGGGGAC TACACCTATT ATGATG 26 



50 



2) INFORMATION FOR SEQ ID NO: 1582 

(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1582 
GGCAAATCAG TCAGTTCAGG AGT 



2) INFORMATION FOR SEQ ID NO: 1583 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1583 
CGATTGGCAA CAATACACTC CTG 



2) INFORMATION FOR SEQ ID NO: 1584 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1584 
TCACCTATTT TTACGCCTGG TAGGAC 



2) INFORMATION FOR SEQ ID NO: 1585 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 645 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecium 
5 (C) ACCESSION NUMBER: AF139725 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1585 

ATGACTATAC CTGACGCAAA TGCAATCTAT CCTAACTCAG CCATCAAAGA 50 

10 GGTTGTCTTT ATCAAGAACG TGATCAAAAG TCCCAATATT GAAATTGGGG 100 

ACTACACCTA TTATGATGAC CCAGTAAATC CCACCGATTT TGAGAAACAC 150 

GTTACCCATC ACTATGAATT TCTAGGCGAC AAATTAATCA TCGGTAAATT 200 

TTGTTCTATC GCCAGTGGCA TTGAATTTAT CATGAACGGT GCCAACCACG 250 

TAATGAAAGG TATTTCGACT TATCCATTTA ATATTTTAGG TGGCGATTGG 300 

15 CAACAATACA CTCCTGAACT GACTGATTTG CCGTTGAAAG GTGATACTGT 350 

AGTCGGAAAT GACGTGTGGT< TTGGGCAAAA TGTGACCGTC CTACCAGGCG 4 00 

TAAAAATAGG TGACGGTGCC ATTATCGGAG CAAATAGTGT TGTAACAAAA 4 50 

GACGTCGCTC CATATACAAT TGTCGGTGGC AATCCAATTC AACTCATCGG 500 

ACCAAGATTT GAACCGGAAG TTATTCAAGC ATTAGAAAAT CTGGCATGGT 550 

20 GGAATAAAGA TATTGAATGG ATAACTGCTA ATGTTCCTAA ACTAATGCAA 600 

ACAACACCCA CACTTGAATT GATAAACAGT TTAATGGAAA AATAA 645 



25 2) INFORMATION FOR SEQ ID NO: 1586 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 
30 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1586 

ATTCCCACAA TCTTTTTTAT C AATAA 26 



40 



50 



2) INFORMATION FOR SEQ ID NO: 1587 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 24 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1587 

CATTGTTCAG ATTCGGTAAA GTTC 24 
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2) INFORMATION FOR SEQ ID NO: 1588 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1588 
GTTTTTGAAG TTAAATAGTG TTCTT 25 



2) INFORMATION FOR SEQ ID NO: 1589 

(i) SEQUENCE CHARACTERISTICS-: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1589 
CTTCCATTTG TACTTTCCCT A 21 



2) INFORMATION FOR SEQ ID NO: 1590 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1920 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cloning vector pFW16 
(C) ACCESSION NUMBER: U50983 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1590 

ATGAAAATTA TTAATATTGG AGTTTTAGCT CATGTTGATG CAGGAAAAAC 50 

TACCTTAACA GAAAGCTTAT TATATAACAG TGGAGCGATT ACAGAATTAG 100 

GAAGCGTGGA CAAAGGTACA ACGAGGACGG ATAATACGCT TTTAGAACGT 150 

CAGAGAGGAA TTACAATTCA GACAGGAATA ACCTCTTTTC AGTGGGAAAA 200 

TACGAAGGTG AACATCATAG ACACGCCAGG ACATATGGAT TTCTTAGCAG 250 

823 

SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



AAGTATATCG TTCATTATCA 
GCAAAAGATG GCGTACAAGC 
GAAAATGGGG ATTCCCACAA 
GAATTGATTT ATCAACGGTT 
GAAATTGTAA TCAAACAGAA 
GAACTTTACC GAATCTGAAC 
ACCTTTTAGA GAAATATATG 
GAACAAGAGG AAAGCATAAG 
TCATGGAAGT GCAAAAAGTA 
TTACTAATAA ATTTTATTCA 
GGAAATGTTT TCAAAATTGA 
TATACGCCTT TATAGTGGAG 
CAGAAAAGGA AAAAATAAAA 
GAATTATGTA AGATTGATAG 



GTTTTAGATG 
ACAAACTCGT 
TCTTTTTTAT 
TATCAGGATA 
GGTAGAACTG 
AATGGGATAC 
TCCGGTAAAT 
ATTTCAGAAT 
ATATAGGGAT 
TCAACACATC 
ATATACAAAA 
TACTACATTT 
GTTACAGAAA 
AGCTTATTCT 



GGGCAATTCT ACTGATTTCT 
ATATTATTTC ATGCACTTAG 
CAATAAGATT GACCAAAATG 
TTAAAGAGAA ACTTTCTGCC 
TATCCTAATG TGTGTGTGAC 
GGTAATAGAG GGAAACGATG 
CATTAGAAGC ATTGGAACTC 
TGTTCTCTGT TCCCTCTTTA 
TGATAACCTT ATAGAAGTTA 
GAGGTCCGTC TGAACTTTGC 
AAAAGACAAC GTCTTGCATA 
ACGAGATTCG GTTAGAGTAT 
TGTATACTTC AATAAATGGT 
GGAGAAATTG TTATTTTGCA 



300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
850 
900 
950 



10 



15 



20 



25 



30 



35 



40 



45 



AAATGAGTTT TTGAAGTTAA ATAGTGTTCT TGGAGATACA AAACTATTGC 1000 

CACAGAGAAA AAAGATTGAA AATCCGCACC CTCTACTACA AACAACTGTT 1050 

GAACCGAGTA AACCTGAACA GAGAGAAATG TTGCTTGATG CCCTTTTGGA 1100 

AATCTCAGAT AGTGATCCGC TTCTACGATA TTACGTGGAT TCTACGACAC 1150 

ATGAAATTAT ACTTTCTTTC TTAGGGAAAG TACAAATGGA AGTGATTAGT 1200 

GCACTGTTGC AAGAAAAGTA TCATGTGGAG ATAGAAATAA CAGAGCCTAC 1250 

AGTCATTTAT ATGGAGAGAC CGTTAAAAAA TGCAGAATAT ACCATTCACA 1300 

TCGAAGTGCC GCCAAATCCT TTCTGGGCTT CCATTGGTCT ATCTGTATCA 1350 

CCGCTTCCGT TGGGAAGTGG AATGCAGTAT GAGAGCTCGG TTTCTCTTGG 1400 

ATACTTAAAT CAATCATTTC AAAATGCAGT TATGGAAGGG ATACGCTATG 1450 

GTTGTGAACA AGGATTGTAT GGTTGGAATG TGACGGACTG TAAAATCTGT 1500 

TTTAAGTATG GCTTATACTA TAGCCCTGTT AGTACCCCAG CAGATTTTCG 1550 

GATGCTTGCT CCTATTGTAT TGGAACAAGT CTTAAAAAAA GCTGGAACAG 1600 

AATTGTTAGA GCCATATCTT AGTTTTAAAA TTTATGCGCC ACAGGAATAT 1650 

CTTTCACGAG CATACAACGA TGCTCCTAAA TATTGTGCGA ACATCGTAGA 1700 

CACTCAATTG AAAAATAATG AGGTCATTCT TAGTGGAGAA ATCCCTGCTC 1750 

GGTGTATTCA AGAATATCGT AGTGATTTAA CTTTCTTTAC AAATGGACGT 1800 

AGTGTTTGTT TAACAGAGTT AAAAGGGTAC CATGTTACTA CCGGTGAACC 1850 

TGTTTGCCAG CCCCGTCGTC CAAATAGTCG GATAGATAAA GTACGATATA 1900 

TGTTCAATAA AATAACTTAG 1920 



2) INFORMATION FOR SEQ ID NO: 1591 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1591 
ATGAGGTAAT AGAACGGATT 20 



2) INFORMATION FOR SEQ ID NO: 1592 
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(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1592 
CAGTATTTCA GTAAGCGTAA A 



2) INFORMATION FOR SEQ ID NO: 1593 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1593 
CCGAGCGATT TACCGGATAC TTGGCTGCIC GCTCGG 



2) INFORMATION FOR SEQ ID NO: 1594 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1032 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecium 

(B) STRAIN: N97-330 

(C) ACCESSION NUMBER: AF175293 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1594 

ATGTTTAGAA TTAAAGTTGC AGTTCTGTTT GGGGGCTGTT CAGAGGAACA 50 

TAATGTTTCG ATAAAATCTG CGATGGAGAT TGCCGCAAAC ATAGATACAA 100 

AAAAGTATCA GCCTTATTAT ATTGGAATCA CAAAATCCGG CGTTTGGAAA 150 

ATGTGTGAAA AACCTTGTTT GGAGTGGGAA CAATATGCGG GGGATCCGGT 200 

TGTTTTTTCA CCGGACAGAA GTACGCATGG TCTGCTGATA CAAAAAGACA 250 
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CTGGGTATGA AATCCAGCCT GTGGATGTGG GATTACCGAT GATTCATGGC 300 

AAGTTTGGCG AGGATGGCTC CATACAAGGC TTGCTTGAAT TGTCAGGCAT 350 

TCCGTATGTG GGATGCGATA TTCAAAGCTC CGTGACCTGC ATGGATAAGG 400 

CGCTTGCATA TACCGTTGTG AAAAATGCGG GTATCGCTGT GCCTGGGTTC 450 

5 CGGATCCTTC AGGAGGGGGA TCGCCTGGAA ACAGAGGATT TCGTATATCC 500 

CGTTTTTGTA AAGCCTGCCC GTTCCGGCTC ATCCTTTGGC GTAAACAAGG 550 

TATGCAAGGC AGAAGAACTG CAGGCAGCAA TCGAAGATGC AAGAAAATAT 600 

GACAGCAAGA TTTTGATTGA AGAGGCCGTT ACCGGGAGTG AGGTAGGCTG 650 

CGCCATACTG GGAAACGGAA ATGATCTCAT GGCTGGCGAG GTGGATCAGA 700 

10 TTGAGCTGAG ACACGGCTTT TTTAAGATTC ATCAGGAAGC ACAGCCGGAG 750 

AAAGGATCTG AAAATGCAGT CATCCGAGTT CCAGCCGCCT TACCGGATGA 800 

GGTAATAGAA CGGATTCAGA AAACAGCAAT GAAGATTTAC CGGATACTTG 850 

GCTGCAGAGG ATTGGCCCGC ATTGACCTGT TTTTGCGGGA GGACGGCTGC 900 

ATTGTGTTGA ATGAAGTGAA TACCATGCCG GGTTTTACTT CCTACAGCCG 950 

15 TTATCCCCGC ATGATGACAG CAGCCGGTTT TACGCTTACT GAAATACTGG 1000 

ATCGCTTGAT TGAACTTTCA CTTAGGAGGT AA 1032 



20 2) INFORMATION FOR SEQ ID NO: 1595 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 
25 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1595 

AAATAATGCT CCATCAATTT GCTGA 2 5 



35 



45 



50 



2) INFORMATION FOR SEQ ID NO: 1596 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 24 bases 
40 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1596 
ATAGTCGAAA AAGCCATCCA CAAG 24 

2) INFORMATION FOR SEQ ID NO: 1597 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1597 
GATGAATTTG CGAAAATACA TGGA 



2) INFORMATION FOR SEQ ID NO: 1598 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1598 
CAGCCAATTT CTACCCCTTT CAC 



2) INFORMATION FOR SEQ ID NO: 1599 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 604 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecalis 

(B) STRAIN: BM4405 

(C) ACCESSION NUMBER: AF136925 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1599 

CAGGGAGTAT TTGAGTTATT AGATATTCCA TATGTAGGTT GTGGTATCGG 50 

AGCTGCAGCA ATTTCCATGA ATAAAATAAT GCTCCATCAA TTTGCTGAAA 100 

TAGTTGGTGT AAAAAGCACT CCTAGTATGA TTATAGAAAA GGGACAAGAC 150 

CTACAAAAAG TCGATGAATT TGCGAAAATA CATGGATTTC CTTTATATAT 2 00 

CAAACCGAAT GAGGCAGGCT CATCAAAAGG AATTAGCAAG GTAGAACAAA 250 

AAAGTGATTT ATATAAAGCA ATAGACGAAG CTTCAAAATA TGATAGCCGT 300 
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ATTTTAATTC AAAAGGAAGT GAAAGGGGTA GAAATTGGCT GTGGGATTTT 
AGGGAATGAA CAATTGGTCG TTGGAGAATG TGATCAAATT AGTCTTGTGG 
ATGGCTTTTT CGACTATGAA GAGAAATACA ATTTAGTAAC AGCAGAAATT 
TTGTTACCAG CTAAACTATC AATAGACAAA AAAGAAGACA TCCAGATAAA 
AGCAAAAAAA CTATACAGAC TATTAGGGTG CAAAGGATTA GCGAGAATCG 
ACTTTTTCTT AACGGATGAC GGAGAAATTT TATTAAATGA GATCAACACC 
CTCC 
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2) INFORMATION FOR SEQ ID NO: 1600 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 805 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Campylobacter jejuni subsp. doylei 

(B) STRAIN: ATCC 49349 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1600 

ACCTCAAATT AATGAAGCAA TTGTTGTAAA TTTTGAAAGC GAAGGAAAAA 50 

AACATAAACT TGTTTTAGAA GTAGCAGCTC ATTTAGGAGA TAATAGAGTT 100 

AGAACTATTG CTATGGATAT GACAGATGGT TTGGTAAGGG GCTTAAAAGC 150 

TGAGGCTTTA GGTGCTCCTA TTAGTGTTCC TGTTGGTGAG AAAGTTTTAG 200 

GAAGAATTTT CAATGTTACT GGAGATTTGA TCGATGAAGG TGAAGAAATT 250 

CCTTTTGATA AAAAATGGGC AATTCATAGA GATCCGCCAG CTTTTGAAGA 300 

TCAAAGCACA AAAAGTGAGA TTTTTGAAAC AGGGATTAAA GTTGTAGATT 350 

TACTTGCTCC TTATGCAAAA GGTGGTAAAG TAGGTCTTTT TGGTGGTGCA 400 

GGTGTTGGTA AAACTGTTAT TATTATGGAG CTTATTCACA ATGTTGCATT 450 

TAAGCATAGC GGCTATTCTG TATTTGCAGG TGTGGGTGAG AGAACTCGTG 500 

AAGGAAATGA CCTTTATAAT GAAATGAAAG AAAGTAATGT TTTAGACAAA 550 

GTTGCTCTAT GTTATGGACA AATGAATGAA CCACCAGGAG CAAGAAATCG 600 

TATTGCTTTA ACAGGTTTAA CAATGGCTGA GTATTTTAGA GATGAAATGG 650 

GTCTTGATGT GCTTATGTTT ATTGATAATA TCTTTAGATT TTCACAATCA 700 

GGTTCTGAAA TGTCAGCACT TTTAGGAAGA ATTCCATCAG CTGTGGGTTA 750 

TCAACCAACC CTAGCAAGTG AAATGGGTAA ATTCCAAGAA AGAATTACTT 800 

CAACT 805 



2) INFORMATION FOR SEQ ID NO: 1601 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 826 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus sulfureus 

(B) STRAIN: ATCC 49903 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1601 

CTCCTTACCA GATATCAACA ATGCGCTGGT TGTATATAAA AATGATGCCA 50 

ATAAAACAAA AGTTGTTTTA GAAGCAGCGT TAGAATTAGG TGACGGAATC 100 

ATTCGTGCGA TTGCAATGGA ATCGACTGAT GGATTGCAAC GTGGGATGGA 150 

AGTTGTCGAT ATGGGAGAAT CTATTTCTGT, ACCAGTTGGA ACAGAAACTT 200 

TAGGACGTGT GTTTAATGTA TTAGGAGATa' CGATTGACTT AGAAGCTCCT 250 

TTTCCTAAAG ATGCACCGCG TAGTGGAATC CATAAAAAAG CCCCTAATTT 300 

TGATGAATTA AGTACAAGCA CAGAAATTCT AGAAACAGGG ATCAAAGTCA 350 

TCGATCTATT AGCCCCTTAT TTAAAAGGTG GTAAAGTTGG ATTGTTTGGG 4 00 

GGAGCCGGTG TAGGAAAAAC GGTATTGATT CAAGAGTTGA TTCACAATAT 450 

CGCACAAGAA CACGGTGGGA TTTCGGTCTT TACTGGTGTT GGTGAGCGTA 500 

CACGTGAAGG AAACGATTTG TATTATGAAA TGAAAGATTC AGGAGTCATT 550 

GAAAAAACAG CGATGGTGTT TGGACAAATG AATGAGCCAC CTGGTGCACG 600 

GATGCGTGTA GCCTTGACTG GATTAACGAT TGCTGAATAT TTCCGTGATG 650 

TTGAAGGACA GGATGTACTG TTGTTTATCG ACAATATCTT CCGTTTTACA 700 

CAAGCTGGTT CTGAAGTGTC TGCCTTGTTA GGTCGTATGC CATCAGCTGT 750 

GGGATATCAA CCAACATTAG CTACCGAAAT GGGGCAATTG CAAGAGCGGA 800 

TCACGTCAAC GAAAAAAGGA TCGATC 826 



2) INFORMATION FOR SEQ ID NO: 1602 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 833 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus solitarius 

(B) STRAIN: ATCC 49428 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1602 

TGATACTTTA CCAGATATTA ATAATGCATT AGTAGTATAT AAAAAGGACG 50 

AGGACAAGAC ACGCGTTGTC TTAGAAGCCA CCTTGGAACT TGGAGATGGC 100 

ATGATTCGTG CAATCTCTAT GGGATCGACT GATGGCTTGC AACGGGGAAT 150 

GGAAGTTGTG GACACACAAG CACCTATTTC TGTTCCAGTA GGAAATGAAA 200 

CCTTAGGACG TGTTTTTAAT GTCTTAGGAG AAACGATTGA TAAACAGGCA 25 0 

CCGTTTCCTG AAGATGCCAA AAAAAGTGGT ATTCATAAAA AAGCACCCGC 300 

TTTTGATGAA TTAAGTACCA GTTCTGAAAT ATTGGAAACC GGGATTAAAG 350 

TAATCGATTT GCTAGCTCCT TATTTACGAG GTGGTAAAGT TGGATTATTT 400 
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GGCGGTGCTG GCGTGGGTAA 

CGTTGCCCAA GAACATGGGG 

GTACTCGTGA AGGAAATGAC 

ATTGAAAAAA CGGCTATGGT 

5 ACGTATGCGT GTAGCGTTAA 

ATGTACAAGG TCAAGACGTA 

ACTCAAGCAG GAACAGAAGT 

CGTTGGTTAC CAACCAACTC 

GAATCACATC GACAGATAAA 



AACGGTATTA ATTCAAGAAT 
GAATTTCTGT TTTTACGGGT 
CTATATTATG AAATGCAGGA 
ATTTGGACAA ATGAACGAAC 
CTGGTTTGAC ACTTGCTGAG 
TTATTATTTA TAGATAATAT 
ATCTGCTTTA TTAGGACGGA 
TAGCAACGGA AATGGGACAG 
GGATCAATTA CCT 
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TAATTCATAA 


450 


GTCGGAGAGC 


500 


TTCAGGCGTT 


550 


CCCCTGGTGC 


600 


TACTTCCGTG 


650 


TTTCCGCTTT 


700 


TGCCGTCTGC 


750 


TTGCAAGAAC 


800 




833 



2) INFORMATION FOR SEQ ID NO: 1603 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 806 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



20 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Campylobacter sputorum subsp. sputorum 
25 - (B) STRAIN: ATCC 35980 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1603 

ATCAGCCTAA AATTAACGAA GCAATAGAAG TTAATTATGA ATTAGATGGT 50 

30 AAAAAAATAA GACTTATTCT TGAAGTAGCT GGACATCTTG GCGATAATAG 100 

AGCAAGAACC ATTGCTATGG ATATGAGTGA TGGTTTACAA AGAGGATTAG 150 

AAGTTACGGC TCTTGGTGCT CCTATAACAG TTCCTGTTGG AGATAAAGTT 2 00 

TTAGGTAGAA TGTTTAATGT TGTAGGTGAC TTAATAGATG AAGGTGAAGT 2 50 

AACAGATTTT GATAAAAGAT GGGCTATCCA TAGAGATCCT CCTTCGTTTG 300 

35 AAGATCAAAG TACAAAAAGT GAAATTTTTG AAACAGGTAT AAAAGTAGTT 350 

GATCTTCTTG CACCTTATGC AAAAGGTGGT AAAGTTGGCT TATTTGGTGG 4 00 

TGCTGGCGTT GGAAAAACAG TTATCATAAT GGAGCTTATA CATAATGTTG 4 50 

CATTTAAACA CAGCGGTTAT TCAATTTTTG CCGGTGTTGG AGAGAGAACA 500 

AGAGAGGGAA ATGATCTTTA TAATGAGATG AAAGAGTCTG GTGTTTTGGA 550 

4 0 TAAAGTTGCC TTATGTTATG GACAAATGAA TGAACCACCA GGAGCAAGAA 600 

ACCGTATAGC ATTAACAGGT CTTACAATGG CTGAATATTT CCGTGATGAA 650 

ATGGGGCTTG ATGTGTTGAT GTTTATAGAT AATATTTTTA GATTTTCTCA 700 

ATCAGGTTCT GAAATGTCAG CGCTGCTTGG TAGAATTCCA TCTGCTGTTG 750 

GTTATCAACC AACATTAGCA AGTGAGATGG GAAAACTTCA AGAAAGAATT 800 

45 ACTTCC 806 



2) INFORMATION FOR SEQ ID NO: 1604 

50 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 738 bases 

(B) TYPE: Nucleic acid 
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(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus pseudoavium 

(B) STRAIN: ATCC 49372 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1604 



GGTGTTATCC GCACAATCGC 
GATGGAAGTT GTCGATACCG 
AAACATTAGG TCGTGTGTTT 
GCACCTTTTC CAGAAGATGT 
CGCTTTTGAA GACCTTAGCA 
AGGTTATCGA CTTATTAGCC 
TTCGGTGGTG CCGGTGTTGG 
TAATATCGCC CAAGAACACG 
AACGGACTCG TGAAGGGAAC 
GTTATTGAAA AAACAGCGAT 
TGCGCGGATG CGCGTTGCCT 
GTGATGAAGA AGGTCAAGAT 
TTCACACAAG CCGGATCAGA 
TGCCGTTGGT TATCAACCAA 
AACGAATCAC TTCAACGAAA 



TATGGAATCT ACAGATGGAT 
GCAAACCAAT CTCTGTTCCT 
AACGTATTAG GTGAAACGAT 
AGAAAAGAGC GGTATTCACA 
CCAGTAATGA GATTTTAGAA 
CCTTACTTAA AAGGTGGTAA 
TAAAACCGTC TTAATTCAAG 
GTGGGATTTC TGTCTTTACC 
GACCTTTATT ATGAAATGAA 
GGTCTTCGGA CAAATGAATG 
TGACTGGTTT GACATTAGCT 
GTGTTGCTAT TTATCGATAA 
AGTTTCGGCG CTATTAGGTC 
CCTTGGCAAC AGAAATGGGT 
AAAGGCTCAA TTACATCG 



TGCAACGGGG 50 

GTAGGTAAAG 100 

CGACAAAGAA 150 

AAAAGGCCCC 200 

ACTGGGATCA 250 

AGTTGGACTA 300 

AACTGATTCA 350 

GGGGTTGGGG 400 

AGAATCCGGC 450 

AGCCACCAGG 500 

GAATATTTCC 550 

CATTTTCCGC 600 

GGATGCCATC 650 

CAATTACAAG 700 
738 



2) INFORMATION FOR SEQ ID NO: 1605 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1605 
ATYATYGAAR TITAYGCICC 



2) INFORMATION FOR SEQ ID NO: 1606 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1606 
5 CCRAACATIA YICCIACTTT TTC 23 



10 



2) INFORMATION FOR SEQ ID NO: 1607 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 336 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
15 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 
20 (A) ORGANISM: Klebsiella ornithinolytica 

(B) STRAIN: ATCC 31898 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1607 



25 


CTGGATTATG 


CGATGTCGGT 


CATTGTTGGC 


CGTGCGCTGC 


CGGATGTCCG 


50 




AGATGGCCTG 


AAACCGGTAC 


ACCGTCGCGT 


ACTTTACGCC 


ATGAACGTAT 


100 




TGGGCAATGA 


CTGGAACAAA 


GCCTATAAAA 


AATCCGCCCG 


TGTCGTTGGT 


150 




GACGTAATCG 


GTAAATACCA 


CCCTCATGGT 


GATACCGCCG 


TTTATGACAC 


200 




CATTGTACGT 


ATGGCACAGC 


CATTCTCCTT 


GCGTTATATG 


CTGGTCGATG 


250 


30 


GCCAGGGTAA 


CTTCGGTTCT 


GTCGATGGCG 


ACTCCGCCGC 


AGCGATGCGT 


300 




TATACGGAAA 


TCCGTATGTC 


GAAAATCGCC 


CACGAG 




336 



35 2) INFORMATION FOR SEQ ID NO: 1608 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 341 bases 

(B) TYPE: Nucleic acid 

4 0 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

4 5 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella oxytoca 

(B) STRAIN: ATCC 13182 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1608 

CTATCTGGAT TATGCGATGT CGGTCATTGT TGGCCGTGCG CTGCCGGATG 50 
TCCGAGATGG CCTGAAGCCG GTACACCGTC GCGTACTATA CGCCATGAAC 100 
GTATTGGGCA ATGACTGGAA CAAAGCCTAT AAAAAATCTG CCCGTGTCGT 150 
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GGGTGACGTC ATCGGTAAAT ACCACCCTCA TGGTGATACT GCCGTATACG 
ACACCATTGT ACGTATGGCG CAGCCATTCT CCCTGCGTTA CATGCTGGTA 
GATGGCCAGG GTAACTTTGG TTCGGTCGAC GGCGACTCCG CCGCAGCGAT 
GCGTTATACG GAAATCCGTA TGTCGAAGAT CGCCCATGAA C 



200 
250 
300 
341 



2) INFORMATION FOR SEQ ID NO: 1609 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single, 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1609 
GCCCTGATCC AAATAGCATA TA 22 



2) INFORMATION FOR SEQ ID NO: 1610 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1610 
CCTGGCATAA CAGTAACATT CTG 23 



2) INFORMATION FOR SEQ ID NO: 1611 

(i) SEQUENCE CHARACTERISTICS: 
• (A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1611 
TGGGAAAAAG CAACTCCATC TC 22 
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2) INFORMATION FOR SEQ ID NO: 1612 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1612 
ACAACTGAAT TCGCAGCAAC AAT 



2) INFORMATION FOR SEQ ID NO: 1613 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 639 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus aureus 
(C) ACCESSION NUMBER: U19459 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1613 

ATGAAATATG GCCCTGATCC AAATAGCATA TATCCACATG AAGAAATAAA 50 

AAGTGTTTGT TTTATTAAAA ATACAATTAC CAATCCAAAT ATTATAGTTG 100 

GAGATTATAC TTACTATTCC GATGTTAACG GAGCTGAAAA ATTTGAAGAA 150 

CATGTGACAC ATCATTATGA ATTTAGGGGT GATAAACTTG TAATTGGCAA 200 

GTTTTGTGCA ATAGCTGAAG GTATAGAATT TATTATGAAT GGAGCAAACC 250 

ATAGAATGAA TTCAATAACA ACTTATCCTT TTAATATAAT GGGAAATGGT 300 

TGGGAAAAAG CAACTCCATC TCTTGAAGAT TTACCATTTA AGGGAGATAC 350 

TGTTGTTGGA AATGATGTGT GGATTGGTCA GAATGTTACT GTTATGCCAG 4 00 

GAATTCAAAT AGGAGATGGA GCAATTGTTG CTGCGAATTC AGTTGTTACA 4 50 

AAAGATGTAC CACCATATCG TATTATTGGT GGAAATCCGA GTAGAATTAT 500 

AAAGAAAAGG TTTGAAGATG AATTGATAGA TTATTTATTG CAAATAAAAT 550 

GGTGGGATTG GTCAGCACAA AAAATATTTT CTAATCTTGA AACACTTTGT 600 

AGCTCTGATT TAGAGAAAAT AAAATCTATT CGAGATTAG 639 



2) INFORMATION FOR SEQ ID NO: 1614 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 
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(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1614 
CCAATCCAGA AGAAATATAC CC 



2) INFORMATION FOR SEQ ID NO: 1615 

(i) SEQUENCE CHARACTERISTICS:' 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1615 
ATTAGTTTAT CCCCAATCAA TTCA 



2) INFORMATION FOR SEQ ID NO: 1616 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1616 
ATAATGAATG GGGCTAATCA TCGTAT 



2) INFORMATION FOR SEQ ID NO: 1617 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1617 
GCCAACAACT GAATAAGGAT CAAC 24 

5 

2) INFORMATION FOR SEQ ID NO: 1618 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 639 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

15 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus cohnii 
(C) ACCESSION NUMBER: AF015628 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1618 



20 



ATGAAATGGC AAAATCAGCA AGGCCCCAAT CCAGAAGAAA TATACCCTAT 50 

AGAAGGTAAT AAACATGTTC AATTTATTAA ACCATCTATA ACAAAGCCCA 100 

25 ATATTTTAGT TGGGGAATAT TCATATTACG ATAGTAAAGA TGGTGAATCT 150 

TTTGAAAGCC AAGTTCTTTA TCACTATGAA TTGATTGGGG ATAAACTAAT 200 

ATTAGGGAAG TTTTGTTCTA TTGGACCCGG AACGACATTT ATAATGAATG 250 

GGGCTAATCA TCGTATGGAT GGTTCAACAT TTCCATTCAA TCTTTTCGGA 300 

AATGGTTGGG AGAAGCATAC CCCTACATTG GAAGACCTTC CTTATAAGGG 350 

30 TAACACGGAA ATTGGGAACG ATGTTTGGAT TGGACGAGAT GTGACAATTA 4 00 

TGCCCGGTGT AAAAATAGGA AACGGGGCTA TTATTGCAGC AAAATCGGTT 450 

GTGACAAAGA ACGTTGATCC TTATTCAGTT GTTGGCGGTA ATCCTTCACG 500 

ATTAATTAAG ATAAGGTTTT CCAAGGAAAA AATCGCAGCA TTACTAAAAG 550 

TAAGGTGGTG GGACCTAGAG ATAGAGACGA TAAATGAAAA TATTGATTGC 600 

35 ATCCTGAATG GTGATATAAA AAAGGTTAAA AGAAGTTAG 639 



40 



2) INFORMATION FOR SEQ ID NO: 1619 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases ■ 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
45 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1619 

50 

AAGGCAAAAT AAAAGGAGCA AAGC 24 
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2) INFORMATION FOR SEQ ID NO: 1620 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1620 
TGTACCCGAG ACATCTTCAC CAC 



2) INFORMATION FOR SEQ ID NO: 1621 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1621 
AATTGAAGGA CGGGTATTGT GGAAAG 



2) INFORMATION FOR SEQ ID NO: 1622 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1622 
CGATTTTGAC AGATGGCGAT AATGAA 



2) INFORMATION FOR SEQ ID NO: 1623 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1569 bases 

(B) TYPE: Nucleic acid 
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(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) 



MOLECULE TYPE: Genomic DNA 



5 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus aureus 
(C) ACCESSION NUMBER: M90056 

10 (xi) SEQUENCE DESCRIPTION : SEQ ID NO: 1623 

ATGAAAATAA TGTTAGAGGG ACTTAATATA AAACATTATG TTCAAGATCG 
TTTATTGTTG AACATAAATC GCCTAAAGAT TTATCAGAAT GATCGTATTG 
GTTTAATTGG TAAAAATGGA AGTGGAAAAA CAACGTTACT TCACATATTA 

15 TATAAAAAAA TTGTGCCTGA AGAAGGTATT GTAAAACAAT TTTCACATTG 
TGAACTTATT CCTCAATTGA AGCTCATAGA ATCAACTAAA AGTGGTGGTG 
AAGTAACACG AAACTATATT CGGCAAGCGC TTGATAAAAA TCCAGAACTG 
CTATTAGCAG ATGAACCAAC AACTAACTTA GATAATAACT ATATAGAAAA 
ATTAGAACAG GATTTAAAAA ATTGGCATGG AGCATTTATT ATAGTTTCAC 

20 ATGATCGCGC TTTTTTAGAT AACTTGTGTA CTACTATATG GGAAATTGAC 
GAGGGAAGAA TAACTGAATA TAAGGGGAAT TATAGTAACT ATGTTGAACA 
AAAAGAATTA GAAAGACATC GAGAAGAATT AGAATATGAA AAATATGAAA 
AAGAAAAGAA ACGATTGGAA AAAGCTATAA ATATAAAAGA ACAGAAAGCT 
CAACGAGCAA CTAAAAAACC GAAAAACTTA AGTTTATCTG AAGGCAAAAT 

25 AAAAGGAGCA AAGCCATACT TTGCAGGTAA GCAAAAGAAG TTACGAAAAA 
CTGTAAAATC TCTAGAAACC AGACTAGAAA AACTTGAAAG CGTCGAAAAG 
AGAAACGAAC TTCCTCCACT TAAAATGGAT TTAGTGAACT TAGAAAGTGT 
AAAAAATAGA ACTATAATAC GTGGTGAAGA TGTCTCGGGT ACAATTGAAG 
GACGGGTATT GTGGAAAGCA AAAAGTTTTA GTATTCGCGG AGGAGACAAG 

30 ATGGCAATTA TCGGATCTAA TGGTACAGGA AAGACAACGT TTATTAAAAA 
AATTGTGCAT GGGAATCCTG GTATTTCATT ATCGCCATCT GTCAAAATCG 
GTTATTTTAG CCAAAAAATA GATACATTAG AATTAGATAA GAGCATTTTA 
GAAAATGTTC AATCTTCTTC ACAACAAAAT GAAACTCTTA TTCGAACTAT 
TCTAGCTAGA ATGCATTTTT TTAGAGATGA TGTTTATAAA CCAATAAGTG 

35 TCTTAAGTGG TGGAGAGCGA GTTAAAGTAG CACTAACTAA AGTATTCTTA 
AGTGAAGTTA ATACGTTGGT ACTAGATGAA CCAACAAACT TTCTTGATAT 
GGAAGCTATA GAGGCGTTTG AATCTTTGTT AAAGGAATAT AATGGCAGTA 
TAATCTTTGT ATCTCACGAT CGTAAATTTA TCGAAAAAGT AGCCACTCGA 
ATAATGACAA TTGATAATAA AGAAATAAAA ATATTTGATG GCACATATGA 

40 ACAATTTAAA CAAGCTGAAA AGCCAACAAG GAATATTAAA GAAGATAAAA 
AACTTTTACT TGAGACAAAA ATTACAGAAG TACTCAGTCG ATTGAGTATT 
GAACCTTCGG AAGAATTAGA ACAAGAGTTT CAAAACTTAA TAAATGAAAA 
AAGAAATTTG GATAAATAA 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
850 
900 
950 
1000 
1050 
1100 
1150 
1200 
1250 
1300 
1350 
1400 
1450 
1500 
1550 
1569 
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2) INFORMATION FOR SEQ 



ID NO: 1624 



50 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1624 
TTCTTTAATG CTCGTAGATG AACCTA 



2) INFORMATION FOR SEQ ID NO: 1625 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: • Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1625 
TTTTCGTATT CTTCTTGTTG CTTTC 



2) INFORMATION FOR SEQ ID NO: 1626 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1626 
AGGAATGATT AAGCCCCCTT CAAAAA 



2) INFORMATION FOR SEQ ID NO: 1627 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

{ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1627 
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2) INFORMATION FOR SEQ ID NO: 1628 



5 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1658 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
10 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
15- (A) ORGANISM: Staphylococcus aureus 

(C) ACCESSION. NUMBER: U82085 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1628 

2 0 ATGCTTAAAA TCGACATGAA GAATGTAAAA AAATATTATG CAGATAAATT 50 

AATTTTAAAT ATAAAAGAAC TAAAGATTTA TAGTGGGGAT AAAATAGGTA 100 

TTGTAGGTAA GAATGGAGTT GGCAAAACAA CACTTTTAAA AATAATAAAA 150 

GGACTAATAG AGATTGACGA AGGAAATATA ATTATAAGTG AAAAAACAAC 200 

TATTAAATAT ATCTCTCAAT TAGAAGAACC ACATAGTAAG ATAATTGATG 250 

25 GAAAATATGC TTCAATATTT CAAGTTGAAA ATAAGTGGAA TGACAATATG 300 

AGTGGTGGTG AAAAAACTAG ATTTAAACTA GCAGAGGGAT TTCAAGATCA 350 

ATGTTCTTTA ATGCTCGTAG ATGAACCTAC AAGTAATTTA GATATCGAAG 400 

GAATAGAGTT GATAACAAAT ACTTTTAAAG AGTACCGTGA TACTTTTTTG 4 50 

GTAGTATCTC ATGATAGAAT TTTTTTAGAT CAAGTTTGTA CAAAAATTTT 500 

3 0 TGAAATTGAA AATGGATATA TTAGAGAATT CATCGGTAAT TATACAAACT 550 

ATATAGAGCA AAAAGAAATG CTTCTACGAA AGCAACAAGA AGAATACGAA 600 

AAGTATAATT CTAAAAGAAA GCAATTGGAG CAAGCTATAA AGCTAAAAGA 650 

GAATAAGGCG CAAGGAATGA TTAAGCCCCC TTCAAAAACA ATGGGAACAT 700 

CTGAATCTAG AATATGGAAA ATGCAACATG CTACTAAACA AAAAAAGATG 750 

35 CATAGAAATA CGAAATCGTT GGAAACACGA ATAGATAAAT TAAATCATGT 800 

AGAAAAAATA AAAGAGCTTC CTTCTATTAA AATGGATTTA CCTAATAGAG 850 

AGCAATTTCA TGGTCGCAAT GTAATTAGTT TAAAAAACTT ATCTATAAAA 900 

TTTAATAATC AATTTCTTTG GAGAGATGCT TCATTTGTCA TTAAAGGTGG 950 

AGAAAAGGTT GCTATAATTG GTAACAATGG TGTAGGAAAA ACAACATTGT 1000 

40 TGAAGCTGAT TCTAGAAAAA GTAGAATCAG TAATAATATC ACCATCAGTT 1050 

AAAATTGGAT ACGTCAGTCA AAACTTAGAT GTTCTACAAT CTCATAAATC 1100 

TATCTTAGAA AATGTTATGT CTACCTCCAT TCAAGATGAA ACAATAGCAA 1150 

GAATTGTTCT AGCAAGATTA CATTTTTATC GCAATGATGT TCATAAAGAA 1200 

ATAAATGTTT TGAGTGGTGG AGAACAAATA AAGGTTGCTT TTGCCAAGCT 1250 

45 ATTTGTTAGC GATTGTAATA CATTAATTCT TGATGAACCA ACAAACTATT 1300 

TGGATATCGA TGCTGTTGAG GCATTAGAAG AATTGTTAAT TACCTATGAA 1350 

GGTGTTGTGT TATTTGCTTC CCATGATAAA AAATTTATAC AAAACCTAGC 1400 

TGAACAATTG TTAATAATAG AAAATAATAA AGTGAAAAAA TTCGAAGGAA 14 50 

CATATATAGA ATATTTAAAA ATTAAAGATA AACCAAAATT AAATACAAAT 1500 

50 GAAAAAGAAC TCAAAGAAAA AAAGATGATA CTAGAAATGC AAATTTCATC 1550 

ATTATTAAGT AAAATCTCAA TGGAAGAAAA TGAAGAAAAA AACAAAGAAT 1600 

TAGATGAAAA GTACAAATTG AAATTAAAAG AATTGAAAAG CCTAAATAAA 1650 

AATATTTA 1658 
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2) INFORMATION FOR SEQ ID NO: 1629 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1629 
AAGGGGAAAG TTTGGATTAC ACAACA 



2) INFORMATION FOR SEQ ID NO: 1630 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1630 
GAACCACAGG GCATTATCAG AACC 



2) INFORMATION FOR SEQ ID NO: 1631 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1631 
CGACGATGCT TTATGGTTTG T 



2) INFORMATION FOR SEQ ID NO: 1632 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 
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(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1632 
GTTAATTTGC CTATCTTGTC ACACTC 26 

2 ) INFORMATION FOR SEQ ID NO: 1633 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 900 bases 
15 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus aureus 
(C) ACCESSION NUMBER: M36022 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1633 

ATGGAATTTA AATTACAAGA ATTAAATCTT ACTAACCAAG ATACAGGACC 50 

ATATGGTATA ACCGTTTCAG ATAAGGGGAA AGTTTGGATT ACACAACATA 100 

AAGCAAATAT GATAAGTTGC ATCAATTTAG ATGGAAAAAT TACAGAGTAC 150 

30 CCACTACCGA CACCAGATGC AAAAGTCATG TGTTTAACTA TATCCTCAGA 200 

TGGGGAAGTT TGGTTTACTG AGAATGCAGC AAACAAAATA GGGAGGATTA 250 

CAAAAAAAGG GATTATTAAG GAATATACAT TGCCTAACCC AGATTCAGCA 300 

CCCTACGGTA TTACAGAAGG ACCAAATGGA GATATATGGT TTACAGAAAT 350 

GAATGGCAAC CGTATTGGAC GTATTACGGA CGACGGTAAA ATTCGTGAAT 400 

35 ACGAGCTGCC TAATAAAGGA TCTTACCCTT CTTTTATCAC TTTGGGTTCT 450 

GATAATGCCC TGTGGTTCAC AGAAAATCAA AATAATGCTA TTGGTAGAAT 500 

TACAGAAAGT GGGGATATTA CAGAGTTTAA AATTCCTACA CCTGCATCAG 550 

GACCAGTTGG TATTACAAAG GGGAACGACG ATGCTTTATG GTTTGTGGAA 600 

ATTATCGGTA ATAAGATAGG GCGAATAACT CCTCTGGGGG AAATTACCGA 650 

4 0 ATTCAAAATT CCAACGCCAA ACGCTCGACC TCATGCAATT ACTGCTGGAG 700 

CAGGAATTGA TTTATGGTTT ACTGAATGGG GGGCTAATAA AATAGGAAGG 750 

CTGACAAGCA ATAATATAAT TGAGGAATAC CCAATTCAAA TCAAAAGTGG 800 

TGAACCACAT GGCATTTGTT TCGATGGTGA AACAATTTGG TTTGCAATGG 850 

AGTGTGACAA GATAGGCAAA TTAACTCTCA TTAAGGATAA TATGGAGTGA 900 

45 



2) INFORMATION FOR SEQ ID NO: 1634 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
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(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1634 
TTAACTTGTC TATTCCCGAT TCAGG 



2) INFORMATION FOR SEQ ID NO: 1635 

(i) SEQUENCE CHARACTERISTICS: , 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1635 
GCTGTGGCAA TGGATATTCT GTA 



2) INFORMATION FOR SEQ ID NO: 1636 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1636 
TTCCTACCCC TGATGCTAAA GTGA 



2) INFORMATION FOR SEQ ID NO: 1637 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1637 
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CAAAGTGCGT TATCCGAACC TAA 



2) INFORMATION FOR SEQ ID NO: 1638 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 527 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aspergillus 

(B) STRAIN: WSA-172 

(xi) SEQUENCE DESCRIPTION: SEQ 

GGTGGGGAAG ACTGTCTTCA TCCAGGAATT 
GCTCACCTAG TTTTCGCTCG ATCTTTTCAC 
AACATTGCCA AGGCCCACGG TGGTTACTCC 
GCGTACTCGT GAGGGTAACG ATCTGTACCA 
TCATTCAGCT CGAGGGTGAA TCCAAGGTCG 
AACGAGCCCC CCGGTGCCCG TGCCCGTGTC 
TGCCGAGTAC TTCCGTGACG AGGAGGGTCA 
ACAACATTTT CCGTTTCACC CAGGCCGGTT 
GGTCGTATCC CCTCTGCCGT CGGTTACCAG 
GGGTGGTATG CAGGAGCGTA TCACCACCAC 
CCGTCCARGC CGTCTACGTC CCCGCGA 



fumigatus 



ID NO: 1638 

GATTGTGAGT CGTTCCACAT 50 

TAACGCAAAC CATGTAGAAC 100 

GTCTTCACTG GTGTTGGTGA 150 

CGAAATGCAG GAGACTGGTG 200 

CACTGGTGTT CGGACAGATG 250 

GCCCTTACCG GTCTGACCAT 300 

GGACGTGCTG CTCTTCATTG 350 

CTGAGGTGTC TGCCCTTCTC 400 

CCCACCCTGG CCGTCGACAT 4 50 

CAAGAAGGGT TCTATTACCT 500 

527 



2) INFORMATION FOR SEQ ID NO: 1639 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 52 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aspergillus fumigatus 

(B) STRAIN: ATCC 64746 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1639 

ACTGTCTTGA TTCAAGAACT TATTAACAAC ATTGCTAAAG CCCATGGTGG 50 
TTACTCTATT TTCTGTGGTG TTGGTGAACG TACTCGTGAA GGTAACGATT 100 
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TATACCACGA AATGATGGAA ACTGGTGTCA 
AAGTGTGCTC TTGTATTCGG TCAAATGAAC 
CCGTGTTGCT TTAACTGGTT TAACCATTGC 
AAGGTCAAGA TGTGTTACTT TTCATTGATA 
5 GCTGGTTCTG AAGTATCTGC CCTTTTAGGT 
TTACCAACCC ACTTTATCTA CTGATATGGG 
CTACTACCAA GAATGGTTCC ATTACCTCTG 
GC 



TTAAACTTGA AGGTGACTCC 150 

GAACCTCCTG GTGCTCGTGC 200 

TGAATACTTC CGTGATGAAG 250 

ACATTTTCCG TTTCACTCAA 300 

CGTATTCCAT CTGCTGTAGG 350 

TGGTATGCAA GAACGTATTA 400 

TACARGCCGT CTACGTCCCC 450 

452 



2) INFORMATION FOR SEQ ID NO: 1640 

(i) SEQUENCE CHARACTERISTICS: ' 
15 (A) LENGTH: 783 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

2 0 (ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus mycoides 

(B) STRAIN: ATCC 6462 
25 (C) ACCESSION NUMBER: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1640 



ACGCCCTTAC GGTAAAACAA AGCAACGAAA ACGGAAGCAT GAACTTAACA 50 

30 TTTGAAGTTG CACTTCATTT AGGTGATGAT ACAGTTCGTA CAGTTGCGAT 100 

GTCTTCCACA GATGGACTTG TTCGTGGCAC AGAAGTAGAA GATACTGGTA 150 

AAGCAATCTC TGTACCAGTT GGTGATGCAA CACTTGGTCG TGTATTCAAC 200 

GTATTAGGTG ATGCAATTGA CTTAGATGGT GAACTTCCTG CGGATGTACA 250 

CCGTGATCCA ATTCACCGTC AAGCACCTGC ATTCGAAGAA TTATCTACTA 300 

35 AAGTAGAAAT TCTTGAAACT GGTATTAAAG TAGTAGACTT ACTTGCTCCT 350 

TACATTAAGG GTGGTAAGAT CGGCCTATTC GGTGGTGCCG GCGTAGGTAA 4 00 

AACAGTATTA ATTCAAGAAT TAATTAACAA CATCGCACAA GAGCACGGTG 450 

GTATCTCTGT ATTCGCTGGT GTAGGTGAGC GTACTCGTGA GGGTAATGAC 500 

TTATACCATG AAATGAGCGA TTCTGGCGTA ATCAAGAAAA CTGCGATGGT 550 

4 0 ATTCGGACAA ATGAATGAGC CACCTGGAGC ACGTCAACGT GTTGCATTAA 600 

CAGGTTTAAC AATGGCTGAG CATTTCCGTG ATGAGCAAGG ACAAGACGTA 650 

CTTCTGTTCA TCGATAACAT CTTCCGTTTC ACGCAAGCGG GTTCTGAAGT 700 

ATCTGCCCTT CTTGGTCGTA TGCCATCTGC GGTAGGTTAC CAACCAACAC ' 750 

TTGCAACAGA AATGGGTCAA TTACAAGAGC GTA 783 

45 



2) INFORMATION FOR SEQ ID NO: 1641 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 823 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
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(D) TOPOLOGY: Linear 
(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus mycoides 

(B) STRAIN: NRRL NRS-319 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1641 



ATCTACAACG CCCTTACGGT AAAACAAAGC AACGAAAACG GAGCAAGCAT 50 

TAACTTAACA TTTGAAGTTG CACTTCATTT AGGTGATGAC ACAGTTCGTA 100 

CAGTTGCAAT GTCTTCCACA GATGGACTTG / TTCGTGGCAC AGAAGTAGAA 150 

GATACTGGTA AAGCAATCTC TGTACCAGTT GGTGATGCAA CACTTGGTCG 200 

15 TGTATTCAAC GTATTAGGTG ATGCAATTGA CTTAGATGGT GATGTTCCTG 250 

CGGATGTACG TCGTGATCCA ATTCACCGTC AAGCACCTGC ATTCGAAGAA 300 

CTATCTACTA AAGTAGAAAT TCTTGAAACT GGTATTAAAG TAGTAGACTT 350 

ACTTGCTCCT TACATTAAGG GTGGTAAGAT CGGTCTATTC GGTGGTGCCG 4 00 

GTGTAGGTAA AACAGTATTA ATTCAGGAAT TAATTAACAA CATCGCACAA 4 50 

20 GAACACGGTG GTATCTCTGT ATTCGCTGGT GTAGGTGAGC GTACTCGTGA 500 

GGGTAACGAC TTATACCATG AAATGAGCGA TTCTGGCGTA ATTAAGAAAA 550 

CTGCGATGGT ATTCGGACAA ATGAATGAGC CACCTGGAGC ACGTCAACGT 600 

GTTGCATTAA CAGGTTTAAC AATGGCTGAA CATTTCCGTG ATGAGCAAGG 650 

ACAAGACGTA CTATTGTTCA TCGATAACAT CTTCCGTTTC ACGCAAGCAG 700 

25 GTTCTGAAGT ATCTGCCCTT CTTGGTCGTA TGCCATCTGC GGTAGGTTAC 750 

CAACCAACAC TTGCAACAGA AATGGGTCAA TTACAAGAGC GTATTACATC 800 

TACAAATAAA GGATCTATCA CGT 823 



30 



40 



2) INFORMATION FOR SEQ ID NO: 1642 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 829 bases 
35 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus mycoides 

(B) STRAIN: NRRL BD-15 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1642 

GATGGCGGAA AGCTACCAGA AATCTACAAC GCCCTTACGG TAAAACAAAG 50 

CAACGAAAAC GGAGCAAACA TTAACTTAAC ATTTGAAGTT GCACTTCATT 100 

TAGGTGATGA TACAGTTCGT ACAGTTGCGA TGTCTTCCAC AGATGGACTT 150 

50 GTTCGTGGCA CAGAAGTAGA AGATACTGGT AAAGCAATCT CTGTACCAGT 200 

TGGTGATGTA ACACTTGGTC GTGTATTCAA CGTATTAGGT GATGCAATTG 250 

ACTTAGATGG TGAAGTTCCT GCGGATGTAC GTCGTGATCC AATTCACCGT 300 

CAAGCACCTG CATTCGAAGA ATTATCTACT AAAGTAGAAA TTCTTGAAAC . 350 
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TGGTATTAAA 
TTGGTCTATT 
TTAATTAACA 
TGTAGGTGAG 
ATTCTGGCGT 
CCACCTGGAG 
GCATTTCCGT 
TCTTCCGTTT 
ATGCCATCTG 
ATTACAAGAG 



GT AG TAG ACT 
CGGTGGTGCC 
ACATCGCACA 
CGTACTCGTG 
AATTAAGAAA 
CACGTCAACG 
GATGAGCAAG 
CACGCAAGCA 
CGGTAGGTTA 
CGTATTACAT 



TACTTGCTCC TTACATTAAG GGTGGTAAGA 
GGTGTAGGTA AAACAGTATT AATTCAGGAA 
AGAACACGGT GGTATCTCTG TATTCGCCGG 
AAGGTAACGA CTTATACCAC GAAATGAGCG 
ACTGCGATGG TATTCGGACA AATGAACGAG 
TGTTGCATTA ACAGGTTTAA CAATGGCTGA 
GACAAGACGT ACTACTGTTC ATCGATAACA 
GGTTCTGAAG TATCTGCCCT TCTTGGTCGT 
CCAGCCAACA CTTGCAACAG AAATGGGTCA 
CTACAAATA 



400 
450 
500 
550 
600 
650 
700 
750 
800 
829 



10 



15 



20 



25 



30 



35 



40 



45 



2) INFORMATION FOR SEQ ID NO: 1643 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 823 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus pseudomycoides 

(B) STRAIN: NRRL BD-10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1643 

AATCTACAAT GCCCTTACGG TAAAACAAAG CAACGAAAAC GGAGCAAGCA 50 

TTAACTTAAC ATTTGAAGTT GCACTTCATT TAGGTGATGA CACAGTTCGT 100 

ACAGTTGCGA TGTCTTCCAC AGATGGACTT GTTCGTGGCA CAGAAGTAGA 150 

AGATACTGGT AAACCAATCT CTGTACCGGT TGGTGATGCA ACACTTGGTC 200 

GTGTATTTAA CGTATTAGGT GATGCAATTG ACTTAGATGG TGAAGTTCCT 250 

GCGGATGTAC GCCGTGATCC AATTCACCGT CAAGCACCTG CGTTCGAAGA 300 

GTTATCTACG AAAGTAGAAA TTCTTGAAAC TGGTATTAAA GTAGTAGACT 350 

TACTTGCTCC TTACATTAAA GGTGGTAAAA TCGGTCTATT CGGTGGTGCC 400 

GGTGTAGGTA AAACAGTATT AATCCAGGAA TTAATTAACA ACATCGCACA 450 

AGAGCACGGT GGTATTTCTG TATTCGCTGG TGTAGGTGAG CGTACTCGTG 500 

AAGGTAATGA CTTATACCAC GAAATGAGCG ATTCTGGCGT AATCAAGAAA 550 

ACAGCGATGG TATTCGGACA AATGAACGAG CCACCTGGTG CACGTCAACG 600 

TGTTGCATTA ACAGGATTAA CAATGGCTGA ACATTTCCGT GATGAGCAAG 650 

GACAAGACGT ACTATTGTTC ATCGATAACA TCTTCCGTTT CACGCAAGCG 700 

GGTTCTGAAG TATCTGCCCT TCTTGGTCGT ATGCCATCTG CGGTAGGTTA 750 

CCAACCAACT CTTGCAACAG AAATGGGTCA ATTACAAGAG CGTATTACAT 800 

CTACAAATAA AGGATCTATC ACG 823 



2) INFORMATION FOR SEQ ID NO: 1644 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 708 bases 
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(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus pseudomycoides 

(B) STRAIN: NRRL B-617 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1644 

GGAAAGCTAC CTGAAATCTA CAATGCCCTT, AGAATTAAAC AAGACGCAGT 50 

TAACTTAACT TTAGAAGTTG CACTTCACTT AGGTGATGAT ACAGTTCGTA 100 

CAGTTGCGAT GTCTTCCACA GACGGACTTG TTCGTGGTAC TGCAGTAGAA 150 

GATACTGGCA AAGCGATTTC TGTTCCAGTT GGTGATGCAA CACTTGGTCG 200 

TGTATTTAAC GTATTAGGTG ATGCAATTGA CTTAGATGGT GAGGTTCCAG 250 

CAGATGTACG CCGTGATCCA ATTCACCGTC AAGCACCTGC ATTCGAAGAG 300 

TTATCTACAA AAGTAGAAAT TCTTGAAACT GGTATTAAAG TAGTAGACTT 350 

ACTTGCTCCT TACATTAAAG GTGGTAAAAT CGGTCTATTC GGTGGTGCCG 400 

GTGTAGGTAA AACAGTATTA ATTCAGGAAT TAATTAACAA CATCGCACAA 450 

GAGCACGGTG GTATCTCTGT ATTCGCTGGT GTAGGTGAGC GTACTCGTGA 500 

AGGTAACGAC TTATACCATG AAATGAGCGA TTCTGGCGTA ATCAAGAAAA 550 

CTGCGATGGT ATTCGGACAA ATGAACGAGC CACCTGGTGC ACGTCAACGT 600 

GTTGCATTAA CAGGTTTAAC AATGGCTGAA CATTTCCGTG ATGAGCAAGG 650 

ACAAGACGTA CTATTGTTCA TCGATAACAT CTTCCGTTTC ACGCAAGCGG 700 

GTTCTGAA 708 



2) INFORMATION FOR SEQ ID NO: 1645 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 778 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Budvicia aquatica 

(B) STRAIN: ATCC 35567 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1645 

GTACGATGCG CTTGAAGTTC AAAACGATGT CAAACTAGTG CTGGAAGTTC 50 

AACAGCAGCT CGGTGGTGGT GTCGTTCGTT GTATCGCTAT GGGTACCTCA 100 

GATGGCCTAA GCCGCGGTCT GAAAGTACTT GATTTAGAAC ACCCTATCGA 150 

AGTTCCTGTC GGTGTGGCAA CTCTGGGCCG TATTATGAAC GTGCTTGGTG 200 

AGCCTATCGA TATGAAGGGT GACATTGGTG AAGAAGAGCG TTGGGCTATC 250 

CATCGCGCTG CACCAAGCTA CGAAGATTTA TCCAGTGCTA ACGAACTGCT 300 

GGAAACGGGC ATCAAGGTTA TCGACCTGAT TTGTCCGTTT GCCAAAGGCG 350 
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5 AATGAGCCAC CGGGAAACCG 
GGCGGAAAAA TTCCGTGATG 
ACATTTATCG TTATACCTTA 
CGTATGCCAT CAGCGGTAGG 
TACACTGCAA GAACGTATCA 



GGTGCCGGCG TAGGTAAAAC 
TGCGACTGAG CACTCAGGTT 
CTCGTGAGGG TAATGACTTC 
GACAAAGTAT CTCTGGTTTA 
TCTGCGCGTT GCGTTAACCG 
AAGGCCGTGA CGTTCTGCTG 
GCCGGTACAG AAGTATCAGC 
TTACCAACCA ACGCTGGCGG 
CYTCAACC 
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GGT AAACATG 400 

ACTCTGTATT 450 

TACCACGAAA 500 

TGGCCAAATG 550 

GCCTGACCAT 600 

TTTATCGATA 650 

ACTGTTAGGT 700 

AAGAGATGGG 750 
778 



15 



20 



2) INFORMATION FOR SEQ ID NO: 164 6 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 806 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



25 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Buttiauxella agrestis 

(B) STRAIN: ATCC 33320 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 164 6 



GATGCCGTAC 

30 CCGTCTGGTG 
CTATCGCCAT 
AACCTCGATC 
TATCATGAAC 
AAGAAGAGCG 

35 TCTAGCTCCC 
TTGCCCGTTC 
TGGGTAAAAC 
CACTCCGGTT 
TAACGACTTC 

40 CACTGGTTTA 
GCGTTGACCG 
CGTTCTGCTG 
AAGTATCTGC 
ACTCTGGCAG 

45 AACTGG 



CAAAAGTGTA 
CTGGAAGTTC 
GGGTACTTCT 
ACCCAATTGA 
GTCCTGGGCG 
TTGGGCGATT 
AGGATCTGCT 
GCTAAGGGTG 
TGTAAACATG 
ACTCAGTGTT 
TACCATGAAA 
TGGCCAGATG 
GTCTGACCAT 
TTCGTTGATA 
GCTGCTGGGT 
AAGAGATGGG 



CGACGCTCTT 
AGCAACAGTT 
GATGGTTTGC 
AGTGCCAGTA 
AGCCTATCGA 
CACCGTGCTG 
GGAAACCGGC 
GTAAAGTCGG 
ATGGAGCTGA 
TGCAGGCGTG 
TGACCGATTC 
AACGAGCCAC 
GGCTGAGAAG 
ACATTTACCG 
CGTATGCCAT 
TGTTTTGCAG 



GAGGTTACAA 
AGGTGGTGGC 
GTCGTGGTCT 
GGTAAAGCAA 
CATGAAAGGC 
CTCCTAGCTA 
ATCAAAGTAA 
TCTGTTCGGT 
TCCGTAACAT 
GGGGAACGTA 
CAACGTTCTG 
CAGGTAACCG 
TTCCGTGACG 
TTATACCCTG 
CTGCGGTAGG 
GAGCGTATTA 



ATGGTAAAGA 
GTAGTGCGTA 
GGAAGTTTCT 
CTCTGGGCCG 
GATATCGGCG 
CGAAGAACTG 
TGGACCTGAT 
GGTGCGGGTG 
CGCGATCGAG 
CTCGTGAGGG 
GACAAAGTAT 
TCTGCGCGTT 
AAGGTCGTGA 
GCCGGTACAG 
TTACCAGCCA 
CCTCCACCAA 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
806 



2) INFORMATION FOR SEQ ID NO: 1647 

50 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1122 bases 

(B) TYPE: Nucleic acid 
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(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida norvegica 

(B) STRAIN: ATCC 36586 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1647 

GAAACCGTTG TTGACACTGG TTCCCCAATC ACCGTCCCAG TTGGTCGTGA 50 

AACCTTGGGT CGTATCATTA ACGTTATCGG, TGAACCAATC GATGAACGTG 100 

GTCCAATCAA CACCAAGCAA AGAAACCCAa' TCCATGCTGA TCCTCCTTCA 150 

TTTGCTGAAC AATCCACTTC TGCTGAAGTT TTAGAAACCG GTATTAAAGT 200 

TGTCGATTTA TTGGCTCCTT ATGCCAGAGG TGGTAAAATT GGTTTATTCG 250 

GTGGTGCCGG TGTCGGTAAA ACCGTCTTTA TCCAAGAATT GATTAACAAC 300 

ATTGCTAAAG CTCATGGTGG TTTCTCAGTC TTCACCGGTG TCGGTGAAAG 350 

AACCAGAGAA GGTAACGATT TATACCGTGA AATGAAAGAA ACCGGTGTCA 4 00 

TTAACTTGGA AGGTGACTCT AAAGTCGCTT TAGTCTTCGG TCAAATGAAC 4 50 

GAACCTCCAG GTGCTAGAGC CCGTGTTGCC TTAACCGGTC TAACCATTGC 500 

TGAATACTTC AGAGATGAAG AAGGTCAAGA TGTCTTGTTA TTCGTTGATA 550 

ACATTTTCAG ATTCACCCAA GCTGGTTCAG AAGTCTCTGC CTTATTAGGT 600 

CGTATTCCAT CTGCTGTCGG TTATCAACCA ACCTTGGCCA CTGATATGGG 650 

TTTGTTGCAA GAACGTATCA CCACCACCAG AAAAGGTTCC GTCACTTCTG 700 

TCCAAGCTGT TTATGTCCCA GCTGATGATT TGACTGATCC TGCCCCAGCC .750 

ACCACTTTCG CGCATTTGGA TGCTACTACT GTCTTGTCTC GTGGTATCTC 800 

AGAATTGGGT ATCTACCCAG CTGTCGATCC ATTGGATTCC AAATCAAGAT 850 

TATTAGATGC CGCTGTTGTT GGTCAAGAAC ATTACGATGT TGCTACTCAA 900 

GTTCAACAAA CTTTACAAGC CTACAAATCT TTACAAGATA TTATCGCTAT 950 

TTTAGGTATG GATGAATTAT CAGAACAAGA TAAATTGACC GTTGAAAGAG 1000 

CTAGAAAGAT CCAACGTTTC TTATCTCAAC CATTTGCTGT CGCTGAAGTT 1050 

TTCACTGGTA TTCCAGGTAG ATTAGTTAGA TTGAAAGAAA CCGTTGCTTC 1100 

ATTCAGAGAC GTTTTAGCTG GT 1122 



2) INFORMATION FOR SEQ ID NO: 1648 . 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 813 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: ATCC 700677 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1648 
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TACGTCTCTT ACCCTGACGA TGATTTGCAA 
TGTTTCAAAT GGTAAAGTCA TCGCCCAACT 
GTAACGTTTC ATTTGGTACC AACCAAGCTG 
GGATCAACTA TGAAACCAAT CACAGACTAT 
5 TGTCTACGAT TCAACTGCTA CTATCGTTCA 
CTGGGACAGA TATCCCTCTC TATAACTGGG 
ATTACTCTGC AATATGCCCT TCAACAATCT 
AACACTAAAC AAGGTCGGTC TAGATAAGGC 
TTGGTATCGA CTATCCAAGC ATGCATTATG 

10 ACAACTGAAT CCAACAAAAA ATATGGTGCA 
CGCCTATGCC GCATTCGCAA ATGGTGGTAT 
TCAATAAAGT TGTCTTTAGC GATGGTAGCG 
GGCACAAGAG CCATGAAAGA AACGACTGCT 
GAAAACAGTC TGGACGTACG GAACTGGTCG 

15 TTCCTCAAGC TGGTAAAACA GGTACCTCTA 
GAAAAGTATA TCAAGAACAG TGGTTACGTA 
GGGTTATACC CGT 



GTCGCATCTA CGGTCGTAGA 50 

TGGAGCTCGT CACCAAGCAA 100 

TGGAAACCAA TCGTGACTGG 150 

GCTCCTGCCT TGGAGTACGG 200 

CGATGAGCCC TATAACTACC " 250 

ATCGAGCATA TTTCGGTAAT 300 

CGTAACGTAC CTGCCGTTGA 350 

TAAAACCTTC CTTAATGGTC 400 

CAAACGCCAT TTCAAGTAAT 450 

AGTAGTGAAA AAATAGCTAC 500 

TTACCACAAA CCAATGTACA 550 

AAAAAGAATT TTCTGACCCT 600 

TACATGATGA CAGAAATGAT 650 

TGGTGCCTAC CTGCCTTGGC 700 

ACTATACTGA CGAAGAAATT 750 

GCTCCAGATG AAATGTTTGT 800 

813 



20 



2) INFORMATION FOR SEQ ID NO: 1649 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 813 bases 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



30 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Campylobacter lari 

(B) STRAIN: ATCC 43675 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1649 

ATTAATGAAG CTATTGTTGT TAATTATGAA TTAGAAGGAA AAGAATGCAA 50 

GCTAGTTCTT GAAGTAGCTG CACATTTAGG CGATAATAAA GTAAGAACCA 100 

TCGCTATGGA TATGACAGAT GGTCTTGTTA GAGGTTTAAC AGCCGTCGCA 150 

4 0 ACTGGAAATC CAATTAGTGT TCCAGTAGGC GAAAAAGTTC TTGGAAGAAT 200 

TTTTAATGTA ACGGGTGATT TGATTGATGA GGGCGAAGAA ATCAATTTTG 250 

ATAAGCACTG GTCAATTCAT AGAGATCCAC CTCCATTTGA AGAACAAAGT 300 

ACAAAAAGCG AAATCTTTGA AACAGGTATA AAGGTTGTTG ATTTGCTAGC 350 

TCCTTATGCT AAAGGTGGAA AAGTTGGTCT TTTTGGTGGT GCAGGTGTTG 4 00 

4 5 GTAAAACCGT TATTATTATG GAATTAATTC ACAATGTTGC ATTTAAACAT 4 50 

AGCGGATATT CTGTTTTTGC AGGTGTTGGC GAAAGAACTC GTGAGGGTAA 500 

TGACCTTTAC AATGAAATGA AAGAAAGTAA TGTATTAGAT AAAGTTGCAT 550 

TGTGTTATGG TCAAATGAAT GAACCACCAG GGGCAAGAAA TCGTATAGCT 600 

TTAACAGGTC TTACTATGGC TGAGTATTTT AGAGATGAAA TGGGACTTGA 650 

50 TGTTTTAATG ATTATTGATA ATATTTTCAG ATTTTCTCAA TCAGGCTCAG 700 

AAATGTCAGC ACTTTTAGGA AGAATTCCTT CAGCTGTTGG TTATCAACCA 750 

ACCTTAGCTA GTGAAATGGG TAAGTTCCAA GAAAGAATTA CTTCAACCAA 800 

GAAAGGATCT ATT 813 
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2) INFORMATION FOR SEQ ID NO: 1650 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 570 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY:- Linear 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Coccidioides immitis 

(B) STRAIN: WSA-222 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1650 



GGTGGGGAAG ACTGTGTTCA TTCAGGAATT 
ACTAAAGCCG ACAGCGGTTG CTGATATGCT 
CTCACGGTGG TTACTCCGTG TTCACTGGTG 
GGTAACGATT TGTACCATGA AATGCAAGAG 
CGGAGAGTCC AAGGTCGCTC TTGTCTTCGG 
GTGCCCGTGC CCGTGTTGCC CTTACCGGTT 
CGTGACGAGG AAGGCCAAGA CGGTAGGCTT 
CGTGTGATAC AGGAGGCTAA TCGCTTTTCT 
ACATTTTCCG TTTCACTCAA GCTGGTTCTG 
CGTATTCCTT CCGCTGTCGG TTACCAACCT 
TGTTATGCAG GAACGTATCA CCACCACCAC 
TGCARGCCGT CTACGTCCCC 



GATTGTAAGT TCTGTTATCA 50 

CTAGAACAAC ATTGCTAAGG 100 

TCGGTGAGCG TACCCGTGAG 150 

ACCCGTGTCA TTCAACTCGA 200 

TCAAATGAAC GAGCCCCCTG 250 

TGACCATTGC TGAATACTTC 300 

CATGCTTCTA TCGCTAGGGG 350 

AGTGCTTCTC TTTATTGACA 4 00 

AAGTGTCTGC CTTGCTCGGT 4 50 

ACTCTCGCCG TCGACATGGG 500 

CAAGGGATCC ATTACTTCAG 550 

570 



2) INFORMATION FOR SEQ ID NO: 1651 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 560 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Emmonsia parva 

(B) STRAIN: ATCC 10784 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1651 

TGTCTTCATC CAGGAGTTAA TTGTACGTTT TTTGTCTGCC TACTATAATA 50 

GACGAAGAAA TATTTTGATT TATTTTTCTA ATAATTCAAA TAGAACAACA 100 

TTGCCAAAGC CCACGGTGGT TACTCTGTCT TCACTGGTGT TGGCGAGCGG 150 

ACCCGTGAAG GAAACGATCT GTACCACGAA ATGCAGGAGA CCCGTGTCAT 200 

CCAGCTCGAT GGCGAGTCCA AGGTCGCCCT CGTCTTCGGT CAAATGAACG 250 

AACCCCCCGG AGCCCGTGCC CGTGTTGCCC TCACTGGTCT GACCGTTGCT 300 
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GAATACTTCC GTGACGAGGA AGGTCAAGAT GGTGCGTATA TATATATTCG 
CCAGTAATTT GACTCGAAGC TCCACTCACA CATATATTAG TGCTCCTCTT 
CATCGACAAC ATTTTCCGCT TCACCCAGGC AGGTTCCGAA GTGTCCGCCC 
TGCTCGGCCG TATCCCCTCC GCCGTCGGTT ACCAGCCCAC CCTCGCTGTC 
GACATGGGTA TGATGCAGGA ACGTATCACC ACCACCACCA AGGGCTCCAT 
CACCTCCGTG 



350 
400 
450 
500 
550 
560 
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30 



35 



40 



45 



2) INFORMATION FOR SEQ ID NO: 1652 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 780 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY:. Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Erwinia amylovora 

(B) STRAIN: ATCC 14 97 6 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1652 

CCACAAGTGT ACAGCGCCCT TGAGGTTAAA AATGGTGATG CTCGTCTGGT 50 

GCTGGAAGTT CAGCAGCAGC TGGGCGGTGG CGTGGTTCGT ACCATCGCCA 100 

TGGGTTCTTC AGACGGCCTT AAGCGTGGTC TGGAAGCCGT TGACCTTCAG 150 

CACCCAATTG AAGTACCGGT AGGTACTGCC ACACTTGGCC GTATCATGAA 200 

CGTGCTGGGT GAGCCGATCG ATATGAAAGG CGACATTGGC GAAGAAGAGC 250 

GCTGGGCGAT TCACCGCTCT GCACCTTCTT ACGAAGATCA GTCGAACTCT 300 

CAGGATCTGC TGGAAACCGG CATCAAGGTG ATTGACCTGA TGTGTCCGTT 350 

CGCCAAGGGC GGTAAAGTCG GCTTGTTCGG TGGTGCGGGC GTAGGTAAAA 400 

CCGTCAACAT GATGGAGCTT ATTCGTAACA TTGCGGCTGA GCACTCAGGT 4 50 

TTCTCGGTAT TTGCCGGTGT GGGTGAGCGT ACCCGTGAAG GTAACGACTT 500 

CTACCACGAA ATGACCGACT CCAACGTTAT CGACAAAGTT TCGCTGGTCT 550 

ATGGTCAGAT GAACGAGCCA CCGGGTAACC GTCTGCGCGT TGCGCTGACC 600 

GGTCTGACCA TGGCGGAGAA GTTCCGTGAC GAAGGTCGTG ACGTACTGCT 650 

GTTTATCGAT AACATCTATC GTTACACCCT GGCCGGTACT GAAGTCTCCG 700 

CTCTGCTGGG TCGTATGCCT TCTGCGGTAG GTTACCAGCC AACGCTGGCG 750 

GAAGAGATGG GCGTTCTGCA GGAACGTATC 780 



2) INFORMATION FOR SEQ ID NO: 1653 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 545 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(vi> ORIGINAL SOURCE: 

(A) ORGANISM: Fonsecaea pedrosoi 

(B) STRAIN: ATCC 18831 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1653 

TGTGTTCATT CAGGAGCTGA TTGTGAGTAC CCCGGAGATT TTCCTGCGAT 50 

TGCGCATGAA GCAAGCGCTG ACGTCCATCT AGAACAACAT CGCCAAGGCT 100 

CACGGTGGTT ACTCCGTGTT CTGCGGTGTC GGCGAGCGTA CTCGTGAGGG 150 

TAACGATTTG TACCACGAAA TGCAGGAGAC CGGTGTCATC AACCTCGAGG 200 

GCGAGTCCAA GGTCGCCCTT GTCTTCGGTC AGATGAACGA GCCCCCGGGA 250 

GCCCGTGCCC GTGTCGCCCT TACTGGTCTT ACCGTCGCTG AGTAAGTTTT 300 

GACAACCAGA AGCGAGTATT GCCACAATTA CTGACTAAAA ATCAAGATAT 350 

TTCCGTGACG AGGAGGGCCA GGATGTGCTT CTCTTCATTG ACAACATTTT 400 

CCGTTTCACC CAGGCCGGTT CTGAGGTGTC CGCTCTTCTC GGCCGTATTC 4 50 

CCTCTGCCGT CGGTTACCAG CCCACTCTCG CCGTCGACAT GGGTATGATG 500 

CAGGAGCGTA TCACCACCAC CCAGAAGGGT TCCATCACTT CCGTC 54 5 



2) INFORMATION FOR SEQ ID NO: 1654 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 564 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Fusarium jn< 

(B) STRAIN: WSA-213 

(xi) SEQUENCE DESCRIPTION: SEQ 

TGTGTTTATT CAGGAGCTCA TCAACAACAT 
ACTCTGTCTT CACTGGTGTC GGTGAGCGAA 
TACCACGAAA TGCAGGAGAC TTCCGTTATT 
GGTTGCCCTG GTTTTCGGTC AGATGAACGA 
GTGTCGCTCT TACCGGGTAA GTTGATAGAT 
CAACACCTCA ACACCACCAC TCCCCAAGAC 
ATGATATTAT ATTTACGCCT CTTGGACGCT 
TTTGACTGTT GCTGAGTACT TCAGAGACGA 
TTTTCATTGA CAACATTTTC CGATTCACTC 
GCCCTTCTCG GTCGTATCCC CTCTGCCGTC 
CGTCGACATG GGTGGTATGC AGGAGCGTAT 
CCATTACCTC AGTC 



Hi forme 



ID NO: 1654 

CGCCAAGGCC CACGGTGGTT 50 

CCCGTGAGGG TAACGATCTG 100 

CAGCTTGATG GCGAGTCCAA 150 

GCCCCCTGGA GCTCGTGCCC 200 

AGTGCCTTCC CTTCCTGTTT 250 

ATTGCTACTC ATACTGCACC 300 

AGCTAATGTT GTATCGACAG 350 

GGAGGGTCAG GACGTGCTGC 400 

AGGCCGGTTC CGAGGTGTCT 450 

GGTTACCAGC CCACTCTGGC 500 

TACCACCACC ACCAAGGGTT 550 

564 



2) INFORMATION FOR SEQ ID NO: 1655 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 776 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella 

(B) STRAIN: ATCC 13182 

(xi) SEQUENCE DESCRIPTION: SEQ 

CGTACCGCGC GTGTACGAGG CTCTTGAGGT 
TGGTGCTGGA AGTTCAGCAG CAGCTCGGCG 
GCCATGGGTT CTTCCGACGG TCTGCGTCGC 
CGAGCATCCG ATCGAAGTCC CGGTAGGTAA 
TGAACGTACT GGGCCAACCG GTAGACATGA 
GAGCGTTGGG CGATTCACCG CGCAGCGCCT 
CTCTCAGGAA CTGCTGGAAA CCGGCATCAA 
CGTTTGCGAA GGGCGGTAAA GTTGGTCTGT 
AAAACCGTAA ACATGATGGA GCTGATCCGT 
CGGTTACTCC GTGTTTGCGG GCGTAGGTGA 
ACTTCTACCA CGAAATGACC GACTCCAACG 
GTGTATGGCC AGATGAACGA GCCGCCGGGA 
GACCGGCCTG ACCATGGCTG AGAAGTTCCG 
TGCTGTTCGT CGATAACATC TATCGTTACA 
TCCGCACTGC TGGGTCGTAT GCCTTCAGCG 
GGCGGAAGAG ATGGGCGTTC TGCAGG 



oxytoca 



ID NO: 1655 

ACAAAATGGT AGTGAGAATC 50 

GCGGTATTGT TCGTACCATC 100 

GGTCTGGAAG TCAAAGACCT 150 

AGCAACGCTG GGTCGTATCA 200 

AAGGCGACAT CGGCGAAGAA 250 

TCCTACGAAG AGTTGTCAAA 300 

AGTTATCGAC CTGATGTGTC 350 

TCGGTGGTGC GGGTGTAGGT 400 

AACATCGCGA TCGAGCACTC 4 50 

ACGTACTCGT GAGGGTAACG 500 

TTATCGATAA AGTATCCCTG 550 

AACCGTCTGC GCGTTGCGCT 600 

TGACGAAGGT CGTGACGTTC 650 

CCCTGGCCGG TACTGAAGTA 7 00 

GTAGGTTACC AGCCGACTCT 750 

776 



2) INFORMATION FOR SEQ ID NO: 1656 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 572 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Microsporum audouinii 

(B) STRAIN: ATCC 11347 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1656 

GTGTTCATCC AGGAGTTGAT TGTAAGTGAT TATATTCCCC TAGAAAGAAA 50 
TGTTTTGAAC AAAAGTCTCG AATTAGAAAA TTCTTTTCAG ATACTAATTT 100 
ACTATAGAAC AACATTGCCA AGGCTCACGG TGGTTACTCC GTCTTCACCG 150 
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GTGTCGGAGA GCGAACCCGT 
GAAACTCGTG TCATCCAACT 
CGGTCAGATG AACGAGCCCC 
GTTTGACCAT TGCTGAGTAC 
TTCTTTAAAT TAGATATCTT 
AGTGCTTCTC TTCATCGACA 
AAGTGTCTGC CCTGCTTGGT 
ACTCTTGCCG TCGACATGGG 
GAAGGGATCC ATTACCTCCG 



GAAGGAAACG ATCTGTACCA TGAAATGCAG 
TGATGGCGAG TCCAAGGTCG CCCTGGTCTT 
CAGGTGCCCG TGCCCGTGTT GCTCTTACTG 
TTCCGTGATG AGGAAGGTCA AGACGGTATG 
CTGGAGAAAC AGCGTCTAAC AAATTCTTCC 
ACATCTTCCG TTTCACTCAG GCTGGTTCCG 
CGTATTCCAT CTGCCGTCGG TTACCAACCC 
TGGTATGCAG GAACGTATTA CCACCACCAA 
TC 



200 
250 
300 
350 
400 
450 
500 
550 
572 



2) INFORMATION FOR SEQ ID NO: 1657 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 790 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Obesumbacterium proteus 

(B) STRAIN: ATCC 12841 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1657 

GCCTAAAGTG TATAACGCAC TTGAGGTGAA AGGCGGTGCC ACTAAACTGG 50 

TACTGGAAGT TCAGCAGCAG CTAGGCGGCG GCGTTGTACG CTGTATCGCT 100 

ATGGGTACTT CTGACGGTCT GCGTCGCGGA CTGGACGTTG TTGACCTGGA 150 

GCACCCGATT GAAGTCCCAG TAGGTAAAGC GACCTTAGGC CGCATTATGA 200 

ACGTACTGGG TGAGCCAATT GATATGAAGG GTGATATCGG CGAAGAAGAT 250 

CGCTGGGCTA TTCACCGTGA AGCTCCAAGC TACGAAGAAC TGTCTAACTC 300 

GCAAGAACTG CTGGAAACCG GTATCAAGGT AATGGACTTG ATTTGTCCGT 350 

TCGCTAAGGG CGGTAAAGTC GGTCTGTTCG GTGGTGCGGG TGTTGGTAAA 4 00 

ACAGTAAACA TGATGGAGCT GATCCGTAAC ATCGCGATCG AGCACTCAGG 4 50 

TTACTCTGTA TTTGCCGGCG TGGGTGAACG TACTCGTGAG GGTAACGACT 500 

TCTACCACGA AATGACCGAC TCCAACGTAT TGGACAAAGT ATCACTGGTT 550 

TATGGCCAGA TGAACGAGCC ACCAGGAAAC CGTCTGCGCG TTGCGCTGAC 600 

CGGTCTGACT ATGGCTGAGA AGTTCCGTGA CGAAGGTCGT GACGTACTGC 650 

TGTTCATCGA TAACATCTAC CGTTATACCT TGGCCGGTAC CGAAGTATCT 700 

GCACTGCTGG GTCGTATGCC TTCTGCGGTA GGTTATCAGC CAACGCTGGC 750 

GGAAGAGATG GGTGTTCTGC AAGAACGTAT CACCTCTACC 7 90 



2) INFORMATION FOR SEQ ID NO: 1658 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 622 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
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(D) TOPOLOGY: Linear 
(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Paracoccidioides brasiliensis 

(B) STRAIN: ATCC 200443 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1658 

TGTCTTCATT CAGGAGCTTA TCAACAACAT CGCCAAGGCC CACGGTGGTT 50 

ACTCCGTCTT CACTGGTGTG GGAGAGCGCA CTCGTGAGGG AAACGACTTG 100 

TATCACGAGA TGCAAGAGAC TTCCGTTATC CAGCTCGAAG GCGAATCCAA 150 

GGTTGCCCTC GTCTTCGGTC AAATGAACGA GCCTCCGGGT GCTCGTGCTC 200 

GTGTTGCTCT CACCGGTCGT AAGTGCTCCT TCCCAGATTT CTCTTCCCCA 250 

GTTTCTGGAC CCACTTTTTO CTTCCACCAC CATTCTACTG GGTAGGACCA 300 

AGATAGCACT GCCTATTCTG GTGCCTTCCT ACCGCCTACT CTACTGCCTA 350 

TTCCACCACC TTTTCTACCG CCTCTTCTAC TTGCTATTGT ATACTAACTT 400 

ACTCAAACAG TTACTATTGC TGAGTACTTC CGTGACGCTG AGGGCCAGGA 4 50 

TGTGCTTCTC TTCATCGACA ACATTTTCCG TTTCACCCAG GCCGGTTCCG 500 

AGGTGTCCGC TCTTCTCGGT CGTATCCCCT CCGCCGTCGG TTACCAGCCC 550 

ACCCTTGCCG TCGACATGGG TGGTATGCAG GAGCGTATCA CCACCACCAA 600 

GAAGGGATCC ATTACCTCCG TC 622 



2) INFORMATION FOR SEQ ID NO: 1659 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 794 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Plesiomonas shigelloides 

(B) STRAIN: ATCC 14029 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1659 

GACGCTGTAC CTCAGGTGTA CGATGCACTG ACAGTTGAGG GTGCTGAGCT 50 

GGTAC.TGGAA GTGCAGCAGC AGCTGGGTGG TGGTGTTGTT CGCTGTATCG 100 

CGATGGGTGC CTCTGATGGC CTCAAGCGCG GTCTGAAAGC GCACAATACT 150 

GGTGCTCCTA TCACTGTACC GGTGGGTGTG GAAACACTGG GCCGGATCAT 200 

GGATGTGTTG GGTAACCCGA TTGACCAGAA AGGTCCAATC GGTGAACAAG 250 

ATCGCTGGGT GATCCACCGT GAAGCACCAA GCTACGAAGA TCAGGCTAAC 300 

AGCACTGAAC TGCTGGAAAC CGGTATCAAG GTTATCGACC TGGTATGCCC 350 

GTTTGCGAAA GGCGGTAAAG TCGGTCTGTT CGGTGGTGCC GGTGTAGGTA 4 00 

AAACCGTAAA CATGATGGAG CTGATCCGTA ACATCGCGAT CGAGCACTCC 450 

GGTTATTCCG TGTTTGCGGG CGTGGGTGAG CGTACCCGTG AAGGTAACGA 500 

CTTCTACCAC GAAATGACAG ACTCCAACGT ACTGGACAAA GTATCCCTGG 550 
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TGTACGGTCA GATGAACGAG CCGCCAGGTA ACCGTCTGCG CGTAGCACTG 
ACCGGCCTGA CCATTGCGGA GAAATTCCGT GATGAAGGTC GTGACGTACT 
GCTGTTCATC GATAACATCT ACCGTTATAC CCTGGCGGGG ACCGAAGTAT 
CGGCACTGCT GGGCCGTATG CCTTCTGCGG TAGGTTATCA GCCAACGCTG 
GCGGAAGAGA TGGGTGTACT GCAAGAGCGT ATTACCTCTA CCCG 
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2) INFORMATION FOR SEQ ID NO: 1660 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 799 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Shewanella putrefaciens 

(B) STRAIN: ATCC 8071 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1660 

AGGTATATGA CGCTSTGAAG ATCACAGGTG AAGGCGCCTG TAATGGTTTG 50 

GTGCTGGAAG TTCAGCAACA GCTAGGCGGT GGTGTAGTTC GTACTATCGC 100 

TATGGGTTCT TCTGATGGTC TGCGTCGTGG TCTTGAGGTT GTTAACTCAG 150 

GTTCACCTAT TTCTGTTCCT GTTGGTACCG CCACGCTTGG CCGTATCATG 200 

AACGTATTAG GTGAGCCTAT TGATGAAGCG GGTCCAATCG GTGAAGAAGA 250 

GCGTTATGTT ATTCACCGTG CAGCACCTTC ATATGAAGAT CAATCGAACA 300 

CTACTGAACT GTTAGAGACA GGTATCAAGG TTATTGACCT TGTTTGTCCA 350 

TTCGCTAAGG GTGGTAAAGT AGGTCTGTTC GGTGGTGCGG GTGTTGGTAA 4 00 

AACAGTTAAC ATGATGGAAC TGATTAACAA CATCGCTAAA GCTCACTCGG 450 

GTCTTTCGGT GTTCGCCGGT GTGGGTGAAC GTACTCGTGA AGGTAACGAC 500 

TTCTACTACG AGATGAAAGA TTCTGGCGTT CTCGACAAAG TGGCCATGGT 550 

TTATGGTCAG ATGAACGAGC CACCAGGAAA CCGTTTACGC GTAGCACTGT 600 

CAGGTCTGAC AATGGCTGAG AAGTTCCGTG ACGAAGGTCG TGACGTATTG 650 

TTGTTCGTTG ACAACATCTA CCGTTATACC TTAGCCGGTA CTGAAGTATC 700 

TGCACTGTTA GGCCGTATGC CTTCTGCGGT AGGTTATCAA CCAACATTGG 750 

CTGAAGAAAT GGGCGTTCTG CAAGAGCGTA TTACTTCAAC TAAGACGGG 799 



2) INFORMATION FOR SEQ ID NO: 1661 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 



858 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CA00/01150 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1661 

TGGGAAGCGA AAATCCTG 18 

5 

2) INFORMATION FOR SEQ ID NO: 1662 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 774 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear' 

15 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Campylobacter curvus 

(B) STRAIN: ATCC 35224 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1662 



20 



CTATGCCTCA AACTAGAGAG CATATCTTGC TATCTCGCCA AGTAGGCGTT 50 

CCATATATCG TTGTATTTAT GAACAAAGCC GATATGGTCG ATGACGCTGA 100 

25. GCTTCTTGAG CTAGTCGAGA TGGAAATTCG CGAGCTTCTT AACGAGTACA 150 

ACTTCCCTGG CGATGATACT CCTATCATAT CAGGTTCTGC TCTTAAAGCC 200 

CTCGAAGAGG CTAAAGCAGG CGTTGATGGC GAGTGGTCAG CAAAAGTTCT 250 

TGAGCTTATG GATAAAGTCG ATGAGTATAT CCCAACTCCA GTTCGTGCTA 300 

CCGATAAAGA CTTCCTGATG CCTATCGAAG ACGTTTTCTC TATCTCAGGT 350 

B0 CGTGGAACGG TCGTTACTGG ' TAGGATCGAA AAAGGTGTCG TAAAAGTTGG 400 

CGATACTATC GAGATCGTTG GTATCAAACC TACTCAAACT ACGACAGTTA 450 

CTGGCGTTGA GATGTTTAGG AAAGAGATGG AACAAGGCGA GGCCGGTGAT 500 

AACGTAGGTG TTCTTTTAAG AGGTACTAAA AAAGAAGACG TCGAGCGCGG 550 

CATGGTTCTT TGTAAGCCAA AATCAATCAC TCCTCATACA AAATTTGAGG 600 

J 5 GTGAGGTTTA TATCCTAACA AAAGAGGAAG GCGGACGCCA CACTCCATTC 650 

TTTAACAACT ATAGACCACA ATTTTATGTA AGAACAACAG ACGTTACAGG 700 

TTCTATCACA CTTCCAGAAG GAACTGAGAT GGTTATGCCT GGAGATAATG 750 

TCAGAATTTC CGTTGAACTC ATCG 774 



10 



2) INFORMATION FOR SEQ ID NO: 1663 



(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 791 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

50 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Campylobacter rectus 

859 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CAOO/01150 



(B) STRAIN: ATCC 33238 

(xi) SEQUENCE DESCRIPTION: SEQ 

TTCTGCGGCT GACGGCCCAA TGCCACAAAC 
CTCGCCAAGT AGGCGTTCCG TATATCGTTG 
ATGGTCGATG ATGCCGAGCT TCTTGAGCTG 
GCTTCTAAAC GAGTATGATT TCCCTGGTGA 
GCTCTGCTCT TCAAGCTCTT AATGAAGCCA 
TGGTCTGCAA AAATTCTTGA GCTTATGGCT 
GACTCCGGTT CGTGCAACGG ATAAAGACTT 
TTTTCTCTAT CTCCGGTCGC GGCACCGTCG 
GGTATCGTAA AAGTCGGTGA TACTATCGAG, 
TCAAACAACT ACCGTTACCG GCGTTGAGAT 
AAGGCGAAGC GGGCGATAAC GTAGGCGTTC 
GAAGACGTTG AGCGCGGTAT GGTTCTTTGC 
TCACACTAAA TTTGAGGGAG AGGTTTATAT 
GACGCCATAC TCCATTCTTT AATAACTATA 
ACTACCGACG TTACCGGTTC TATCACTCTT 
TATGCCTGGC GATAACTTAA AGATAAGCGT 



ID NO: 1663 



TAGAGAGCAC 


ATCTTGCTAT 


50 


TTTTTATGAA 


CAAAGCCGAT 


100 


GTTGAGATGG 


AGATTCGCGA 


150 


CGATACTCCA 


ATCGTAGCAG 


200 


AAGCCGGAAC 


AGAAGGCGAG 


250 


AAAGTTGACG 


AGTATATCCC 


300 


CTTGATGCCT 


ATTGAGGACG 


350 


TTACCGGCAG 


AATCGAAAAA 


400 


ATCGTAGGTA 


TCCGCGATAC 


450 


GTTCAGAAAA 


GAGATGGATC 


500 


TTCTAAGAGG 


CACTAAAAAA 


550 


AAACCTAAAT 


CAATCACTCC 


600 


CTTAACTAAA 


GAGGAAGGCG 


650 


GACCGCAGTT 


TTATGTAAGA 


700 


CCGGAAGGAA 


CAGAGATGGT 


750 


TGAGCTTATC 


G 


791 



2) INFORMATION FOR SEQ ID NO: 1664 

(i) SEQUENCE CHARACTERISTICS: 
' (A) LENGTH: 810 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Fonsecaea pedrosoi 

(B) STRAIN: ATCC 18831 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1664 

CGACGGACAG ATGCCCCAGA CCAGGGAGCA CTTGCTCCTC GCCCGCCAGG 50 

TCGGTGTCAA GCGCATTGTC GTCTTCGTCA ACAAGGTCGA TGCCATTGAG 100 

GACAAGGAGA TGTTGGAGCT CGTCGAGATG GAGATGCGTG AGCTTCTCTC 150 

CAGCTACGGC TTCGAGGGTG ACGACACTCC CATCGTCATG GGTTCCGCCC 200 

TTTGCGCCAT TGAGGGCCGC GAGCCCGACA TTGGTGTCGA GAAGATTGAC 250 

GAGCTCCTCG AGCACGTCGA CACCTGGATC CCCACCCCCG AGCGTGACAT 300 

CGCCAAGCCT TTCCTCATGT CCGTTGAGGA CGTCTTCTCC ATTCCCGGCC 350 

GTGGTACCGT CGCTTCTGGC CGTGTCGAGC GTGGTGTCCT GAAGAAGGAT 400 

TCCGAAGTCG AGCTTGTCGG CAAGAACAAG AACCCCATCA AGACCAAGGT 450 

TACCGACATC GAGACCTTCA AGAAGTCTTG CGACGAGTCC CGCGCTGGTG 500 

ACAACTCCGG TCTCCTTCTC CGTGGTGTCA AGCGTGACGA TGTCCTCCGT 550 

GGCATGGTCG TTGTCCAGCC CGGCACCACC AAGGCCCACA AGAAGTTCCT 600 

TGCCTCCATG TACGTCCTCA CCAAGGAGGA GGGTGGCCGC CACACTGGTT 650 

TCGCCAACAA CTACAAGCCC CAGATGTTCA TCCGTACCGC CGATGAGGCC 700 

860 

SUBSTITUTE SHEET (RULE 26) 
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GCCACTCTTA CCTGGCCCGA GGGTACCGAG GAGGACAAGA TGGTCATGCC 750 
CGGTGACAAT GTCGAGATGA TCTGCGAGAT CCACAAGCCC ATTGCCGTCG 800 
AGCAAGGCCA 810 



2) INFORMATION FOR SEQ ID NO: 1665 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

15 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1665 
CAGTACAGGT AGACTTCTG 19 

20 



2) INFORMATION FOR SEQ ID NO: 1666 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 888 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



30 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Microsporum audouinii 

35 (B) STRAIN: ATCC 11347 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1666 

ATGATTGCGA AACCTACTTG CTGTGGAAGA ATTTGGATAT TCTAACATTT 50 

40 CTCTAGGCCT CAAACCAGAG AGCATCTGCT CCTTGCCCGC CAGGTCGGTG 100 

TTCAGAAGCT CGTCGTTTTC GTTAACAAGG TCGACGCTGT TGAGGACCCA 150 

GAGATGTTGG AACTTGTCGA GCTAGAGATG CGTGAGCTGC TCAGCCACTA 200 

TGGTTTCGAG GGTGAGGAGA CCCCAATCAT TTTTGGCTCT GCTCTCTGCG 250 

CCCTTGAATC TCGACGACCA GAATTGGGTG TTGAGAAGAT CGATGAGCTA 300 

45 TTGAACGCTG TGGATACCTG GATTCCCACC CCAGAGCGTG CCACTGATAA 350 

GCCTTTCCTT ATGTCCATTG AGGAAGTTTT CTCCATCTCT GGTCGTGGTA 400 

CCGTCGTTTC CGGTCGTGTC GAGCGTGGTA TCCTCAAGAA GGACTCTGAT 450 

GTCGAAATTG TGGGTGGATC TGATACACCC ATCAAGACGA AGGTCACCGA 500 

CATTGAAACC TTCAAGAAGT CTTGTGACGA ATCCCGAGCT GGTGACAACT 550 

50 CCGGTCTACT TCTCCGAGGT GTCAAGCGTG AGGACTTGAG ACGTGGAATG 600 

GTTGTTGCTG CTCCCGGATC GACCAAGGCT CATACCGACT TCATGGTCTC 650 

CCTTTATGTT CTGACCGAGG CTGAGGGTGG CCGTTCCAAT GGATTCACCC 700 

ACAAGTACCG CCCACAGATG TTCATCCGTA CTGCCGGTAT GTAAACCCTT 750 

861 
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TTTCTACCAT TCACTTTGTT TCACCACTGA CTTGTATACT TTACCGCAGA 
CGAAGCCGCA TCTTTCAGCT GGCCTGGAGA GGATCAAGAC AAGAAGGCCA 
TGCCTGGTGA CAATGTCGAG ATGATTTGCA AGACCCTC 
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2) INFORMATION FOR SEQ ID NO: 1667 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 793 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Piedraia hortai 

(B) STRAIN: ATCC 24292 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1667 

ATGCCGCAGA CCCGCGAGCA CTTGCTGCTC GCCCGTCAGG TCGGTGTCCA 50 

GAAGATCGTT GTTTTCGTCA ACAAGGTTGA TGCTATCGAC GACCCGGAGA 100 

TGCTGGAGCT TGTCGAGATG GAGATGCGTG AACTTCTCAG CACATACGGT 150 

TTCGAGGGTG ACGAGACCCC TGTTATTATG GGCTCCGCGC TCATGGCTCT 200 

CAACAACCAG CGCCCCGAGA TTGGTCAACA GAAGATTGAT GAACTCATGG 250 

CCGCTGTCGA CGAGTGGATC CCTACTCCCC AGCGTGACCT CGACAAGCCT 300 

TTCCTGATGT CTGTTGAGGA TGTCTTCTCC ATTGCTGGCC GTGGTACCGT 350 

TGTGTCCGGC CGTGTGGAGC GCGGTACCCT CAAGCGTGAT GAGGAAGTCG 4 00 

AGCTTGTCGG CAAGGGTGTC GACCCCATCA AGACCAAGGT CACCGATATC 4 50 

GAGACTTTCA AGAAGTCCTG CGAGGAGGCT CAGGCTGGTG ACAACTCTGG 500 

TCTTCTGATC CGTGGTGTCC GCCGCGAGGA TGTTCGTCGC GGTATGGTTG 550 

TCTCCAAGCC CGGCACCGTC AAGTCTCACA CTCAGTTCCT GGCCTCGCTT 600 

TACGTTCTCA CCAAGGAGGA GGGTGGTCGC CACACTGGTT TCGGCGAGCA 650 

CTACCGTCCC CAGCTCTACC TCCGTACCTC AGACGAGTCT GTCGATCTGA 700 

CCTTCCCCGA GGGAACTGAG GATCACCACT CCAAGATCGT CATGCCTGGT 7 50 

GACAACATCG AGATGGTCGT CACGATGACT CACGCCAACG CTA 7 93 



2) INFORMATION FOR SEQ ID NO: 1668 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 891 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 
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(B) STRAIN: K-12 KL1699 

(C) ACCESSION NUMBER: J01717 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1668 

5 

AACATGATCA CCGGTGCTGC GCAGATGGAC GGCGCGATCC TGGTAGTTGC 50 

TGCGACTGAC GGCCCGATGC CGCAGACTCG TGAGCACATC CTGCTGGGTC 100 

GTCAGGTAGG CGTTCCGTAC ATCATCGTGT TCCTGAACAA ATGCGACATG 150 

GTTGATGACG AAGAGCTGCT GGAACTGGTT GAAATGGAAG TTCGTGAACT 200 

10 TCTGTCTCAG TACGACTTCC CGGGCGACGA CACTCCGATC GTTCGTGGTT 250 

CTGCTCTGAA AGCGCTGGAA GGCGACGCAG AGTGGGAAGC GAAAATCCTG 300 

GAACTGGCTG GCTTCCTGGA TTCTTACATT CCGGAACCAG AGCGTGCGAT 350 

TGACAAGCCG TTCCTGCTGC CGATCGAAGA CGTATTCTCC ATCTCCGGTC 400 

GTGGTACCGT TGTTACCGGT CGTGTAGAAC GCGGTATCAT CAAAGTTGGT 4 50 

15 GAAGAAGTTG AAATCGTTGG TATCAAAGAG ACTCAGAAGT CTACCTGTAC 500 

TGGCGTTGAA ATGTTCCGCA AACTGCTGGA CGAAGGCCGT GCTGGTGAGA 550 

ACGTAGGTGT TCTGCTGCGT GGTATCAAAC GTGAAGAAAT CGAACGTGGT 600 

CAGGTACTGG CTAAGCCGGG CACCATCAAG CCGCACACCA AGTTCGAATC 650 

TGAAGTGTAC ATTCTGTCCA AAGATGAAGG CGGCCGTCAT ACTCCGTTCT 700 

20 TCAAAGGCTA CCGTCCGCAG TTCTACTTCC GTACTACTGA CGTGACTGGT 750 

ACCATCGAAC TGCCGGAAGG CGTAGAGATG GTAATGCCGG GCGACAACAT 800 

CAAAATGGTT GTTACCCTGA TCCACCCGAT CGCGATGGAC GACGGTCTGC 850 

GTTTCGCAAT CCGTGAAGGC GGCCGTACCG TTGGCGCGGG C 8 91 



25 



40 



2) INFORMATION FOR SEQ ID NO: 1669 



(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 805 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

35 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Saksenaea vasiformis 

(B) STRAIN: ATCC 60625 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1669 



TCCTCGTGGT CGCCGCCACC GACGGCCCGA TGCCGCAGAC CAAGGAGCAC 50 

GTGCTCCTGG CCCGCCAGGT CGGCGTTCCG TACATCGTCG TCGCCCTCAA 100 

45 CAAGGCCGAC ATGGTGGACG ACGAGGAGAT CCTGGAGCTC GTCGAGCTCG 150 

AGGTCCGTGA GCTCCTCTCC GAGTACGAGT TCCCGGGCGA CGACGTTCCC 200 

GTCGTCAAGG TCTCCGCTCT GAAGGCCCTC GAGGGCGACA AGGAGTGGGG 250 

CAACTCGGTT CTCGAGCTCA TGAACGCCGT CGACACGGCG ATCCCCGAGC 300 

CCGAGCGTGA CGTCGACAAG CCGTTCCTCA TGCCGATCGA GGACGTCTTC 350 

50 ACGATCACCG GTCGCGGTAC GGTCGTCACC GGCCGTATCG AGCGTGGTGT 400 

CCTGAAGGTC AACGAGACCG TCGACATCAT CGGCATCAAG ACCGAGAAGA 450 

CCACCACCAC GGTCACCGGC ATCGAGATGT TCCGGAAGCT CCTCGACGAG 500 

GGCCAGGCCG GTGAGAACGT CGGTCTGCTC CTCCGTGGCA TCAAGCGCGA 550 

863 
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GGACGTCGAG CGCGGCCAGG TCATCATCAA 

ACACGGAGTT CGAGGCGCAG GCCTACATCC 

CGCCACACGC CGTTCTTCAA CAACTACCGC 

GACGGACGTG ACCGGCGTGG TGACCCTCCC 

5 TGCCGGGTGA CAACACCGAG ATGAAGGTGG 
ATGGA 



PCT/CA00/01150 

GCCGGGCTCG GTCACGCCGC 600 
TGTCCAAGGA CGAGGGTGGC 650 
CCGCAGTTCT ACTTCCGTAC 700 
CGAGGGCACC GAGATGGTCA 750 
AGCTCATCCA GCCCGTCGCC 800 

805 



10 2) INFORMATION FOR SEQ ID NO: 1670 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 935 bases 

(B) TYPE: * Nucleic acid 
15 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

20 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trichophyton tonsurans 

(B) STRAIN: ATCC 56185 



25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1670 



GCTTCTGACG GTCAAATGTA ATTGAATGCC CGCCCAGACA GATGAAAGGA 50 

TTTGACGTTT CTAACATCAG TCTAGGCCTC AGACCAGAGA ACATTTGCTC 100 

CTTGCCCGCC AGGTCGGTGT CCAGAAGCTG GTCGTTTTCG TTAACAAGGT 150 

CGATGCCGTT GAGGACCCAG AGATGTTGGA GCTTGTCGAA CTTGAAATGC 200 

30 GTGAACTCCT CAGCCACTAC GGTTTCGAGG GTGAGGAGAC CCCCATCATT 250 

TTTGGCTCTG CTCTCTGTGC CCTCGAGTCC CGTCGACCTG AGCTTGGTGT 300 

CGAGAAGATT GACGAGCTAT TGAACGCCGT CGACACCTGG ATCCCCACCC 350 

CAGAGCGCGC CACTGATAAG CCTTTCCTCA TGTCCATTGA GGAAGTGTTC 400 

TCTATCTCTG GTCGTGGTAC CGTCGTCTCC GGTCGTGTTG AGCGTGGTAT 4 50 

35 CCTCAAGAAG GATTCGGACG TCGAAATTGT TGGTGGCTCT ACCACCCCTA 500 

TCAAGACCAA GGTCACCGAT ATCGAAACCT TCAAGAAGTC CTGCGATGAA 550 

TCTCGAGCTG GTGACAACTC TGGTCTCCTT CTCCGAGGTA TCAAGCGTGA 600 

GGACTTGAAG CGTGGAATGG TTGTTGCTGC CCCCGGATCC ACCAAGGCTC 650 

ACACCGACTT CATGGTCTCC CTCTACGTCC TGACTGAGGC TGAGGGTGGT 700 

4 0 CGTTCCAACG GCTTCACCCA CAAGTACCGC CCCCAAATGT TCATCCGTAC 750 

TGCTGGTATG TAACCCAAGT TTCCGCTATT TACTAAGTAG ATCATTGCTA 800 

ACTTGTATTT CCTTCCGTAG ACGAAGCCGC ATCTTTCAGC TGGCCTGGAG 850 

AAGACCAAGA CAAGAAGGCT ATGCCTGGTG ACAACGTCGA GATGATTTGC 900 

AAGACCCTCC ACCCCATTGC TGCCGAGGCT GGCCA 935 

45 



2) INFORMATION FOR SEQ ID NO: 1671 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 772 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

864 
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10 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: En terobacter aerogenes 

(B) STRAIN: ATCC 13048 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1671 



ACGATGCCCT TGAGGTACAG AATGGTAATG AGAGCCTGGT GCTGGAAGTT 50 

CAGCAGCAGC TCGGCGGTGG CGTAGTCCGT GCTATCGCCA TGGGTTCTTC 100 

CGACGGTCTG CGTCGTGGTC TGGAAGTTAA AGACCTTGAG CACCCGATCG 150 

AAGTCCCGGT AGGTAAAGCG ACTCTGGGCC GTATCATGAA CGTCCTGGGT 200 

15 CAGCCGATCG ACATGAAAGG CGACATCGGC GAAGAAGAAC GTTGGGCTAT 250 

CCACCGCGCG GCGCCTTCCT ATGAAGAGCT GTCCAGCTCT CAGGAACTGC 300 

TGGAAACCGG CATCAAAGTT ATCGACTTGA TGTGTCCGTT CGCTAAGGGC 350 

GGTAAAGTTG GTCTGTTCGG TGGTGCGGGT GTAGGTAAAA CCGTAAACAT 4 00 

GATGGAGCTG ATCCGTAACA TCGCGATCGA GCACTCCGGT TACTCCGTGT 4 50 

20 TTGCGGGCGT TGGTGAGCGT ACTCGTGAGG GTAACGACTT CTATCACGAA 500 

ATGACCGACT CCAACGTTCT GGATAAAGTA TCCCTGGTTT ACGGCCAGAT 550 

GAACGAGCCG CCGGGAAACC GTCTGCGCGT TGCGCTGACC GGCCTGACCA 600 

TGGCTGAGAA ATTCCGTGAC GAAGGTCGTG ACGTTCTGCT GTTCGTCGAT 650 

AACATCTATC GTTACACCCT GGCCGGTACT GAAGTATCTG CACTGCTGGG 700 

25 CCGTATGCCT TCAGCGGTAG GTTATCAGCC GACTCTGGCG GAAGAGATGG 750 

GCGTTCTGCA GGAACGTATC AC 7 72 



30 2) INFORMATION FOR SEQ ID NO: 1672 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1401 bases 

(B) TYPE: Nucleic acid 
35 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



40 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bordetella pertussis 

(B) STRAIN: Tohama 1 

(C) ACCESSION NUMBER: Genome project 



45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1672 

ATGAGCAACG GAACCATCGT TCAGTGCATC GGCGCCGTGG TGGATATTCA 50 

GTTCCCCCGC GATAACATGC CCAAGATCTA CGAAGCGCTC ACCCTGGTCG 100 

ACGAGGGTTC CTCGTTCGCC GAGAAGGGCT TGACGCTGGA AGTGCAACAA 150 

50 CAGCTGGGCG ACGGCGTGGT GCGTACCATC GCGCTGGGTT CCAGCGACGG 200 

GCTGCGCCGC GGCATGCAAG TGGCCGGCAC CGGCGCACCG ATCTCGGTGC 250 

CCGTGGGCCA CGGCACCCTG GGCCGCATCA TGGACGTGCT GGGCCGTCCC 300 

ATCGACGAAG CCGGTCCCAT CGCCTCCGAC GAGAAGCGCG CCATCCACCA 350 

865 
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GCCCGCGCCC CGTTTCGACG AGCTGTCGCC 
CCGGCATCAA GGTTATCGAC CTGGTGTGCC 
GTCGGCCTGT TCGGCGGCGC CGGCGTGGGC 
ACTGATCAAC AACATCGCCA AGCAGCACAG 
GCGTGGGCGA GCGTACCCGC GAAGGCAACG 
GAGTCGAACG TTCTGGACAA GGTGGCCATG 
GCCCCCGGGC AACCGCCTGC . GCGTGGCGCT 
AGAAGTTCCG CGACGAAGGC CGTGACATCC 
TACCGCTACA CCCTGGCCGG TACCGAAGTG 
GCCGTCGGCG GTGGGCTACC AGCCTACGCT 
TGCAAGAGCG CATCACCTCG ACCAAGACCG 
GCCGTGTACG TGCCTGCCGA CGACTTGACC 
CTTCCAGCAC TTGGACTCGA CCGTCGTGCT 
TGGGCATCTA TCCCGCCGTG GACCCGCTGG 
GACCCGCAAG TCGTGGGCGA AGAGCACTAC 
GCAGACGCTG CAGCGCTACA AGGAACTGCG 
GCATGGACGA ACTGTCGCCG GAAGACAAGC 
AAGATCCAGC GCTTCCTGTC GCAGCCCTTC 
CGGCTCGCCG GGCAAGTACG TGTCGCTGGC 
AGATGATCGT CGACGGCGAG TGCGACGCGC 
ATGGTCGGCA CGATCGACGA GGCCTTCGAG 
A 



PCT/CA00/01150 

GTCGGTCGAG CTGCTGGAAA 4 00 

CGTTCGCCAA GGGCGGCAAG 450 

AAGACCGTCA ACATGATGGA 500 

CGGCTTGTCG GTGTTCGCCG 550 

ACTTCTACCA CGAAATGGAA 600 

GTGTTCGGCC AGATGAACGA 650 

GACCGGCCTG ACCATGGCCG 700 

TGTTCTTCGT CGACAACATC 750 

TCGGCGCTGC TGGGCCGTAT 800 

GGCCGAGGAA ATGGGCGTGC 850 

GTTCGATCAC CTCGATCCAG 900 

GACCCGTCGC CCGCCACGAC 950 

GTCGCGTGAC ATCGCTGCGC 1000 

ATTCCTCCAG CCGCCAGCTC 1050 

CAGGTGGCCC GTGGCGTGCA 1100 

CGACATCATC GCGATTCTGG 1150 

AGGCCGTGGC CCGCGCGCGC 1200 

TACGTGGCCG AAGTGTTCAC 12 50 

CGAAACGATC CGTGGCTTCA 1300 

TGCCCGAGCA GGCGTTCTAC 1350 

AAGGCCAAGA AACTCCAATA 1400 

1401 



2) INFORMATION FOR SEQ ID NO: 1673 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 97 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arcanobacterium haemolyticum 

(B) STRAIN: ATCC 9345 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1673 

CAGCTACCGA CGGTCCAATG GCTCAGACCC GCGAGCACGT TCTTCTTGCT 50 

CGCCAGGTTG GCGTTCCACA GATCATCGTT GCTCTCAACA AGGCTGACAT 100 

GGTTGACGAC GAGGAAATCC TCGAACTCGT CGAAATGGAA GTTCGTGAGC 150 

TTCTCTCTTC CCAGGAGTAC CCAGGTGACG ACCTCCCAGT CGTCAAGATC 200 

TCGGCACTCA AGGCTCTCGA AGGCGATGCC GAATGGAGCA AGGCAATCGA 250 

AGATCTCATG GAAGCTGTCG ATACCTACTT CGACGATCCA GTGCGTGACC 3 00 

TCGATAAGCC ATTCCTCATG CCAATCGAAG ACGTCTTCAC CATCACCGGT 350 

CGTGGCACCG TTGTTACCGG CCGTGCAGAG CGCGGTATGC TCAACTTGAA 4 00 

CGAAGAAGTT GAAATCCTCG GTATCCGTGC ACCACAGAAG ACAACCGTTA 4 50 

CCGGTATCGA AATGTTCCAC AAGTCGATGG ATCACGCAGA TGCAGGCGAA 500 

AACTGTGGTC TTCTCCTCCG TGGCACCAAG CGCGAAGATG TTGAACGTGG 550 

TCAGGTTGTT GCCAAGCCAG GCACCATCAC CCCACACACC AACTTCGAAG 600 
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CTCAGGTCTA CGTGCTCGGT AAGGAAGAAG GTGGCCGTCA CAACCCATTC 
TTCTCCAACT ACCGTCCACA GTTCTACTTC CGTACCACGG ATGTTACCGG 
CGTGATCACC CTTCCAGAGG GCACCGAAAT GGTTATGCCA GGCGACAACA 
CCGACATGAC AGTTGAGCTC ATCCAGCCAA TCGCTATGGA AGAGGGC 
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2) INFORMATION FOR SEQ ID NO: 1674 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 785 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE:. Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Butyrivibrio fibrisolvens 

(B) STRAIN: ATCC 19171 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1674 

CTGATGGTCC TATGCCACAG ACCCGTGAGC ACATCCTATT AGCACGTCAG 50 

GTAGGCGTAC CATACATCAT CGTATTCCTA AACAAGTGCG ATATGGTTGA 100 

CGACGAGGAA TTATTAGAGT TAGTTGAGAT GGACGTACGT GATCTATTAA 150 

ATCAGTACCA GTTCCCAGGC GACGACACTC CAATCATCCG TGGTTCAGCA 200 

CTAGGTGCAT TAAACGGCGA AGAGAAGTGG AAAGAGGCAA TCTATCAGTT 2 50 

AGCAGACACT CTAGATTCAT ACATTCCAGA GCCAAAGCGT GATATCGATG 300 

ATCCATTCCT ATTACCAATC GAAGATATCT TCTCAATCTC AGGTCGTGGT 350 

ACTGTAGTAA CCGGCCGTGT AGAGCGTGGT ATTGTACACG TAGGTGACGA 4 00 

AGTTGAAATC GTTGGTATTC GTCCAACCAC CAAGACCACT GTAACTGGCG 450 

TTGAAATGTT CCGTAAGTTA CTAGACGAAG GTCGTGCAGG TGATAACGTT 500 

GGTGTTCTAC TACGTGGTAC CAAGCGTGAT GAGGTTGAGC GTGGTCAGGT 550 

TCTAGCTGCT CCAGGCACAA TCACTCCACA CACCAAGTTC ACTGGTCAGG 600 

TTTACGTACT AAGCAAGGAT GAAGGTGGTC GTCACACTCC ATTCTTCAAG 650 

GGCTACCGTC CACAGTTCTT CTTCCGTACA ACCGATATTA CCGGTTCTAT 700 

CGATCTGAAA GAGGGCGTAG AGATGGTAAT GCCAGGTGAT AACACCGACA 750 

TGACCGTAAC CCTAATCCAC CCAGTAGCTA TGGCT 785 



2) INFORMATION FOR SEQ ID NO: 1675 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 835 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
(vi) ORIGINAL SOURCE: 
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(A) ORGANISM: Campylobacter jejuni subsp. doylei 

(B) STRAIN : ATCC 49349 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1675 

TGGGGCGATC TTGGTTGTTT CTGCTGCAGA TGGTCCTATG CCACAAACTA 50 

GAGAGCACAT TCTTCTTTCT CGTCAAGTAG GCGTTCCATA TATTGTTGTT 100 

TTTATGAATA AAGCAGATAT GGTTGATGAT GCTGAGCTTT TAGAGTTAGT 150 

TGAAATGGAA ATTAGAGAAT TATTAAGCTC TTATGATTTC CCAGGCGATG 200 

ATACACCTAT TATTTCTGGT TCTGCTTTAA AAGCTCTTGA AGAAGCTAAA 2 50 

GCTGGACAAG ATGGTGAATG GTCAGCAAAA ATTATGGATC TTATGGCTGC 300 

AGTTGATAGT TATATTCCAA CTCCAACTCG TGATACTGAA AAAGACTTCT 350 

TGATGCCAAT TGAAGACGTT TTCTCAATTT CAGGTCGTGG TACTGTTGTT 4 00 

ACAGGTAGAA TTGAAAAAGG TGTTGTAAAA GTAGGTGATA CTATCGAAAT 450 

CGTTGGTATT AAAGACACTC AAACAACAAC TGTAACAGGT GTTGAAATGT 500 

TCAGAAAAGA AATGGATCAA GGCGAGGCAG GAGATAACGT AGGTGTTCTT 550 

CTTCGTGGTA CTAAAAAAGA AGAAGTTATT CGCGGTATGG TTCTTGCTAA 600 

ACCAAAATCA ATTACTCCAC ACACTGACTT CGAAGCTGAA GTTTATATCT 650 

TAAATAAAGA TGAAGGTGGT AGACATACTC CATTCTTTAA CAACTATAGA 7 00 

CCACAGTTTT ATGTAAGAAC AACTGATGTT ACAGGTTCGA TTAAATTAGC 750 

TGATGGTGTT GAAATGGTTA TGCCAGGTGA AAATGTGAGA ATTACTGTAA 8 00 

GCTTGATCGC TCCAGTAGCA CTTGAAGAAG GAACT 835 



2 ) INFORMATION FOR SEQ ID NO: 167 6 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 812 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Campylobacter lari 

(B) STRAIN: ATCC 43675 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1676 

TTCTGCAGCA GACGGCCCTA TGCCACAAAC TAGAGAGCAT ATCTTACTTT 50 

CTCGTCAAGT AGGTGTACCA TATATTGTTG TTTTCATGAA CAAAGCTGAT 100 

ATGGTTGATG ATGCAGAATT ATTAGAATTA GTTGAAATGG AAATTAGAGA 150 

ATTACTAAGC TCTTATGATT TCCCAGGAGA TGATACTCCA ATTATTTCAG 200 

GTTCAGCATT ACAAGCTCTT GAAGAAGCAA AAGCTGGTCA AGATGGTGAA 250 

TGGTCTAAAA AAATCTTAGA TCTTATGGCT GCAGTTGATG ATTATATTCC 300 

AACTCCGGCT CGTGATACAG ATAAAGATTT CTTGATGCCA ATCGAAGATG 350 

TTTTCTCAAT CTCAGGTCGT GGAACTGTTG TTACCGGTAG AATTGAAAAA 4 00 

GGTGTTGTAA AAGTTGGTGA TACTATAGAA ATCGTTGGTA TTAGAGACAC 450 

TCAAACAACC ACAGTTACTG GTGTTGAAAT GTTTAGAAAA GAAATGGATC 500 

AAGGTGAAGC TGGTGATAAT GTTGGTGTAT TACTTCGTGG AACTAAAAAA 550 

GAAGATGTTG AACGTGGTAT GGTTCTTGCT AAACCAAAAT CAATCACTCC 600 

868 
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ACATACTGAT TTTGAAGCAG AAGTTTATAT CTTAAATAAA GATGAAGGTG 
GTCGTCATAC TCCATTCTTT AATAATTATA GACCGCAATT TTATGTAAGA 
ACAACTGATG TTACAGGTGC TATTAAACTT GCAGAAGGCG TTGAGATGGT 
TATGCCAGGC GATAATGATA GAATTACTGT AAGTCTTATT GCTCCAGTTG 
CACTTGAGGA AG 
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2) INFORMATION FOR SEQ ID NO: 1677 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 828 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Campylobacter sputorum subsp. sputorum 

(B) STRAIN: ATCC 35980 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1677 

GCTATTCTTG TATGTTCAGC TGCAGATGGT CCAATGCCAC AGACTAGAGA 50 

GCATATTCTA CTATCAAGAC AAGTTGGTGT TCCATACATA GTTGTTTTCT 100 

TAAATAAAGA AGATATGGTT GATGATGCTG AGCTTATAGA GTTGGTTGAA 150 

GTTGAGGTTA GAGATTTATT AAATGAATAT GATTTCCCTG GAGATGATAC 200 

TCCAATCGTA ATAGGTTCTG CTCTTAAAGC TTTAGAAGAA GCAAAAGCTG 250 

GAACAGAGGG TGAATGGTCT GCTAAAATTA TGAAACTTAT GGATGCTGTT 300 

GATAGCTATA TCCCAACTCC AACAAGAGAT ACAGATAAAG ATTTCCTTAT 350 

GCCAATCGAA GATATCTTCT CAATTTCTGG TCGTGGTACA GTTGTAACAG 400 

GTAGAATTGA AAAAGGTGTA GTAAAAGTTG GCGAGACTAT TGAGATAGTT 4 50 

GGTATTAGAC CTACTCAAAC AACAACAGTT ACTGGTGTTG AAATGTTTAG 500 

AAAAGAGCTA GATCAAGGTG AAGCTGGAGA TAATGTTGGT ATCTTGTTAA 550 

GAGGTACAAA AAAAGAAGAT GTTGAAAGAG GTATGGTTTT ATGTAAACCA 600 

AAATCAATCA CTCCTCACAA GAAATTTGAA GGCGAAGTTT ATATTCTTAC 650 

AAAAGATGAA GGTGGTAGAC ATACTCCTTT CTTTAGTAAC TATAGACCAC 700 

AATTTTATGT TAGAACAACA GATGTAACAG GTTCTATATC TCTTCCTGAG 750 

GGAACAGAGA TGGATATGCC TGGTGATAAT GTAAAACTTA CAGTTGAACT 800 

TATAAACCCA ATTGCTCTTG AGCAAGGA 828 



2) INFORMATION FOR SEQ ID NO: 1678 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 820 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



5 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Campylobacter upsaliensis 

(B) STRAIN: ATCC 49815 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1678 



CGATTTTGGT TGTTTCTGCT GCTGATGGTC CTATGCCACA AACTAGAGAG 50 

CACATTTTGC TTTCTCGTCA AGTGGGTGTT CCTTATATCG TAGTTTTTAT 100 

10 GAATAAGGCT GATATGGTTG ATGATGCAGA GCTTTTAGAA TTGGTTGAAA 150 

TGGAAATTAG AGAACTTTTA AGCTCTTATG ATTTCCCGGG CGATGACACT 2 00 

CCTATCATTT CAGGCTCTGC TCTTCAAGCC TTAGAAGAGG CTAAGGCGGG 250 

ACAAGATGGC GAGTGGTCAG CTAAGATTTT AGAGCTTATG AAGGCAGTTG 3 00 

ATGAGTATAT CCCAACTCCT GTTCGCGATA CTGAAAAAGA TTTCTTGATG 350 

15 CCTATTGAAG ATGTTTTTTC AATTTCTGGT CGTGGAACTG TTGTAACAGG 4 00 

TAGAATTGAA AAAGGTGTGG TTAAAGTCGG CGATACTATT GAGATAGTAG 4 50 

GTATCAAAGA TACTCAAACT ACAACAGTTA CAGGCGTTGA GATGTTTAGA 500 

AAAGAAATGG ATCAAGGTGA GGCTGGCGAT AATGTCGGTG TGCTTTTAAG 550 

AGGAACAAAA AAAGAAGATG TTCTTCGTGG TATGGTTCTT GCAAAGCCTA 600 

20 AATCTATCAC TCCTCATACT GATTTTGAAG CAGAAGTTTA TATTCTAAAT 650 

AAAGATGAGG GCGGTCGCCA TACTCCTTTC TTTAACAATT ATCGTCCGCA 700 

GTTTTATGTA AGAACGACTG ATGTAACTGG TTCTATTAAA TTAGCTGATG 750 

GTGTTGAGAT GGTTATGCCG GGTGAAAATG TAAGAATTAC AGTTAGCCTT 800 

ATCGCTCCAG TTGCACTTGA 820 

25 



2) INFORMATION FOR SEQ ID NO: 1679 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 783 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



35 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Globicatella sanguis 
40 (B) STRAIN: ATCC 51173 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1679 

CTGCAGCTGA TGGTCCAATG CCTCAAACAC GTGAACATAT CTTATTATCA 50 

45 CGTCAAGTAG GTGTTCCTTA CATGGTTGTC TTCTTAAACA AAGTTGACAT 100 

GGTTGACGAT GAAGAATTAT TAGAATTAGT TGAAATGGAA GTTCGTGATT 150 

TATTATCTGA ATACGAATTC CCTGGAGACG ACGTTCCAGT AATCGCTGGT 200 

TCAGCTTTAA AAGCTTTAGA AGGCGAAGAA CAATATGAAG CAAAAGTATT 250 

AGAATTAATG GAAGCTGTAG ATACATACAT TCCAGAACCA GTTCGTGATA 300 

50 CTGAAAAACC ATTCATGATG CCAGTTGAAG ATGTGTTCTC AATCACAGGT 350 

CGTGGTACAG TTGCTACTGG ACGTGTTGAA CGTGGACAAG TTAAAGTTGG 400 

TGACGAAGTT GAAATCGTTG GTATTGCTGA AGAAATTAGC AAAACAACTG 450 

TAACTGGTGT TGAAATGTTC CGTAAATTAT TAGATTACGC TGAAGCTGGA 500 

870 
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TGGTCAAGTA TTAGCAAAAC CAGGAACAAT 
AGGCGGAAGT TTACGTATTA TCAAAAGAAG 
TTCTTCGCTA ACTACCGTCC TCAATTCTAC 
5 AGGTGTTGTA GAATTACCAG AAGGTACAGA 
ACGTATCAAT GACAGTTGAA TTAATTCACC 
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ACACGTGAAC AAATCCAACG 550 

TACACCTCAT ACTAAATTCG 600 

AAGGTGGACG TCATACTCCA 650 

TTCCGTACAA CTGACGTTAC 700 

AATGGTAATG CCTGGAGATA 750 

CAA 783 



10 2) INFORMATION FOR SEQ ID NO: 1680 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 823 bases 

(B) TYPE : ■ Nucleic acid 
15 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

20 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Lactobacillus acidophilus 

(B) STRAIN: ATCC 4356 



25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1680 



GCTATCTTAG TTGTTGCTGC AACTGATGGT CCTATGCCAC AAACTCGTGA 50 

ACACATTTTG CTTGCTCGTC AAGTTGGTGT TAACTACATC GTAGTATTCT 100 

TGAACAAGTG CGATTTAGTT GACGACCCAG AATTGATCGA CTTGGTTGAA 150 

ATGGAAGTTC GTGACTTGTT GACTGAATAC GATTACCCTG GTGATGATAT 200 

30 TCCAGTTGTT CGTGGTTCAG CATTAAAGGC TTTACAAGGT GACAAGGAAG 250 

CTCAAGACCA AATCATGAAG TTGATGGACA TTGTTGATGA ATACATCCCA 300 

ACTCCAGAAC GTCAAACTGA CAAGCCATTC TTGATGCCAG TTGAAGACGT 350 

ATTCACTATC ACTGGTCGTG GTACTGTTGC TTCAGGTCGT ATCGACCGTG 4 00 

GTACTGTTAA GGTCGGTGAC GAAGTTGAAA TCGTTGGTTT GGTAGATAAA 450 

35 GTTCTTAAGT CAGTTGTTAC TGGTTTGGAA ATGTTCCACA AGACTTTGGA 500 

CTTAGGTGAA GCCGGCGATA ACGTTGGTGT ATTGCTTCGT GGTGTTGACC 550 

GTGATCAAGT TGTTCGTGGT CAAGTATTGG CTGCACCCGG CTCAATCCAA 600 

ACTCATAAGA AGTTTAAGGC ACAAGTTTAT GTTTTGAAGA AGGACGAAGG 650 

TGGTCGTCAC ACTCCATTCT TCTCAGACTA CCGTCCACAA TTCTACTTCC 700 

40 ACACCACTGA TATTACTGGT GAAATTGAAT TGCCAGAAGG TACTGAAATG 750 

GTTATGCCTG GTGATAACAC TGAATTCACT GTTACTTTGA TCAAGCCAGC 800 

TGCCATCGAA AAGGGTACTA AGT 823 



45 

2) INFORMATION FOR SEQ ID NO: 1681 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 793 bases 
50 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 
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SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

(ii) MOLECULE TYPE: Genomic DNA 



PCT/CAOO/01150 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leuconostoc mesenteroides subsp. 
5 dextranicum 

(B) STRAIN: ATCC 19255 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1681 

10 GCTGCAACTG ATGGTCCTAT GCCACAAACA CGTGAACACA TCTTGTTGGC 50 

ACGTCAAGTT GGTGTTGACT ACTTGGTTGT CTTCTTGAAC AAGACTGATT 100 

TGGTTGATGA TGAAGAATTA GTTGAATTGG TTGAAATGGA AGTTCGTGAA 150 

TTGTTGTCAG AATATGACTT CCCAGGTGAC GATATTCCTG TACTTAAGGG 200 

TTCAGCTTTG AAGGCTTTGG AAGGTGATCC TGAACAAGTT AAGGTTATCG 250 

15 AAGAATTGAT GGATACTGTT GATTCATACA TTCCAGAACC AGCACGTGAA 300 

ACAGACAAGC CATTCTTGAT GCCTGTCGAA GACGTCTTCA CAATCACTGG 3 50 

TCGTGGTACA GTTGCTTCTG GTCGTGTTGA CCGTGGTGTA TTGACTACAG 4 00 

GAACTGAAAT TGAAATCGTT GGTTTGAAGG ACGAAGTTCA AAAGACTACT 4 50 

GTTACAGGTA TCGAAATGTT CCGTAAGACT TTGGAAGAAG CTCAAGCGGG 500 

20 TGATAACATT GGTGCATTGT TGCGTGGTGT TGATCGTAGC AACATTGAAC 550 

GTGGTCAAGT TTTGGCAAAG CCAGGTTCAA TTAAGACACA CAAGAAGTTC 600 

AAGGCTGAAG TTTATGTCTT GACAAAGGAA GAAGGTGGTC GTCATACACC 650 

ATTCTTCACT AACTACCGTC CACAATTCTA CTTCCACACA ACTGATGTTA 700 

CAGGTGTTGT TGAATTGCCA GCCGGTGTTG AAATGGTTAT GCCTGGTGAC 750 

25 CAAGTGACAT TCGAAATCGA ATTGATCTCA CCAGTTGCCA TCG 793 



2) INFORMATION FOR SEQ ID NO: 1682 



30 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 796 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
3 5 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
40 (A) ORGANISM: Prevotella buccalis 

(B) STRAIN: ATCC 35310 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1682 

45 TATCCTTGTA GTTGCTGCTA CTGATGGTCC TATGCCACAG ACACGTGAGC 50 

ACGTGCTTTT GGCTCGTCAG GTAAACGTTC CTCGTTTGGT TGTGTTCATG 100 

AACAAGTGTG ACTTGGTAGA AGACGAAGAG ATGCTTGAAC TCGTTGAAAT 150 

GGAGTTGCGC GAACTTCTTG AGCAATACGA ATTCGAAGAG GATACTCCAA 200 

TCGTTCGTGG TTCTGCACTG GGTGCATTGA ATGGTGTTGA CAAGTGGGTT 250 

50 GACAGCGTGA TGACGTTGAT GGACACTGTT GACGAGTGGA TTCAAGAGCC 3 00 

AGAGCGTGAC CTTGACAAAC CTTTCTTGAT GCCAGTAGAG GACGTGTTCT 350 

CTATCACAGG TCGTGGTACC GTTGTAACAG GACGTATTGA GACTGGTAAG 4 00 

GTAAAGGTTG GCGACGAGAT TCAGTTGCTC GGTCTTGGTG AGGACAAGAA 450 
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GTCTGTTGTA ACAGGCGTTG 
AAGCAGGTGA TAACGTAGGA 
GTAAAGCGTG GTATGGTTGT 
TCACTTCAAG GCTTCCATCT 
ATACTCCATT CGGAAACAAG 
GACTGTACAG GTGAGATCAC 
TGGTGACAAC GTAGAGATTG 



AAATGTTCCG TAAGATTCTT 
CTTTTGCTCC GCGGTATCGA 
TGTACACCCA GGTGCCATCA 
ATGTATTGAA GAAGGAAGAG 
TATCGTCCTC AGTTCTATCT 
TTTGCCAGAA GGCGTAGAGA 
AGGTTACCTT GATTTACAAG 



TCTGAAGGTG 
TAAGGATGAA 
CTCCTCACGA 
GGTGGACGTC 
CCGTACAATG 
TGGTGATGCC 
GTTGCC 



500 
550 
600 
650 
700 
750 
796 



2) INFORMATION FOR SEQ ID NO: 1683 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 800 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Ruminococcus bromii 

(B) STRAIN: ATCC 27255 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1683 

GGTTGCTGCT ACTGACGGCC CGATGCCTCA GACTCGTGAG CACGTTCTGC 50 

TCGCTCGTCA GGTGGGTGTG CCCGCCATCG TCGTCGCCCT CAACAAGTGC 100 

GACATGGTTG ACGATGAGGA GCTCATTGAG CTTGTCGAGA TGGAGGTCCG 150 

CGAGCTGCTG ACCTCGCAGG AGTTCGACGG CGACAACTGC CCTGTCGTTC 200 

GCATCTCCGC CTTCCAGGCC CTCCAGGGTG ACGAGAAGTG GACCCAGTCG 250 

ATCCTCGACC TCATGGACGC CGTGGACGAG TACATCCCGC AGCCTGAGCG 300 

CGATCTCGAC AAGCCCTTCC TTATGCCGAT CGAGGACGTC TTCACCATCA 350 

CCGGCCGTGG CACCGTTGTC ACCGGTCGTG TCGAGCGTGG TGTCGTCAAG 4 00 

ACTGGCGAAG AGGTCGAGAT CGTCGGTATC CACGAGAAGA CCCAGAAGAC 450 

CACCGTTACC GGTGTCGAGA TGTTCCGTAA GATCCTCGAC GAGGGCCGCG 500 

CTGGTGAGAA CGTCGGCGTT CTGCTCCGTG GCACCAAGAA GGAGGACGTC 550 

GTTCGCGGCA TGGTCCTCTC CAAGCCTGGT TCCACCACCC CCCACACCGA 60 0 

CTTCGAGGGC CAGGTCTACG TCCTCAAGAA GGATGAGGGT GGCCGCCACA 650 

AGCCGTTCTT CTCCCATTAC AGCCCCCAGT TCTACTTCCG TACCACGGAC 700 

GTGACTGGCA CTGTTGAGCT CCCCGAGGGC ACCGAGATGG TCATGCCTGG 750 

CGACAACACC GACATGACTG TGCACCTGAT TCACCCGGTT GCCATGGAGG 800 



2) INFORMATION FOR SEQ ID NO: 1684 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 545 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Paracoccidioides brasiliensis 
5 (B) STRAIN : ATCC 32075 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1684 

TGTCTTTATT CAGGAACTGA TTGTATGTTT CTTCTCGTTT ATATATAACA 50 

10 TACCTTCTAT ATTTCATGTG TTTCTAACGA ACTCATAGAA CAACATTGCC 100 

AAGGCTCACG GTGGTTACTC CGTGTTCACC GGTGTCGGTG AGCGTACCCG 150 

TGAAGGAAAC GATCTGTACC ACGAAATGCA GGAAACCCGC GTCATCCAGC 200 

TGGACGGCGA GTCCAAAGTC GCCCTCGTCT TCGGCCAGAT GAACGAGCCC 250 

CCCGGAGCCC GTGCCCGTGT TGCCCTGACC GGTCTGACCA TCGCTGAATA 300 

15 CTTCCGTGAC GAAGAAGGCC AAGATGGTAC GTTCCCCCAT TCCATATATG 350 

TTTCTTGTGC GCTTTGCCAA CTAAACACCA CCTAGTGCTC CTCTTCATCG 400 

ACAATATCTT CCGCTTCACC CAAGCCGGTT CCGAAGTGTC CGCCCTGCTA 450 

GGCCGCATCC CCTCCGCCGT CGGCTATCAA CCCACCCTCG CCGTCGACAT 500 

GGGTGGTATG CAGGAGCGCA TCACAACTAC AACAAAAGGC TCCAT 54 5 

20 



2) INFORMATION FOR SEQ ID NO: 1685 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1020 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida norvegica 
35 (B) STRAIN: ATCC 36586 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1685 

GATATCGCTT TATGGAAATT CGAAACTCCA AAATTCCACG TTACYGTTAT 50 

4 0 CGATGCTCCA GGTCACAGAG ATTTCATCAA GAATATGATT ACYGGTACCT 100 

CCCAAGCTGA TTGTGCTATT TTAATCATTG CTGGTGGTGT TGGTGAATTC 150 

GAAGCTGGTA TCTCCAAAGA TGGTCAAACC AGAGAACACG CTTTGTTAGC 200 

TTTCACCTTA GGTGTYAAAC AATTGATTGT TGCCGTTAAC AAAATGGACT 250 

CTGTCAAATG GGATCAATCC CGTTTCGAAG AAATCGTCAA GGAAGCTTCC 300 

45 GGTTTCATCA AGAAAGTTGG TTACAACCCA AAGACTGTTC CATTCGTTCC 350 

AATCTCTGGT TGGAATGGTG ACAACATGAT TGAAGTYTCW GCTAACGCYC 400 

CATGGTACAA AGGTTGGGAA AAGGAAACCA AAGCYGGTGT CGTTAAAGGT 450 

AAAACTTTAT TAGAAGCCAT TGATGCTATT GAACCACCTT CAAGACCAAC 500 

TGAAAAACCA TTGAGATTGC CATTGCAAGA TGTCTACAAG ATTGGTGGTA 550 

50 TCGGAACCGT ACCAGTCGGT ARAGTTGAAA CCGGTGTCAT TAAACCAGGT 600 

ATGATTGTTA CTTTCGCCCC AGCCGGTGTT ACTACTGAAG TCAAATCTGT 650 

TGAAATGCAT CACGAACAAT TAGAAGCTGG TTACCCAGGT GACAATGTTG 700 

GTTTCAACGT CAAGAATGTT TCAGTTAAAG AAATCAGAAG AGGTAAHGTT 750 
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GCTGGTGACT CCAAGAACGA TCCACCAAAA 

TCAAGTTATT GTCTTGAACC ATCCAGGTCA 

CAGTTTTGGA TTGTCACACT GCCCACATTG 

TTGGAAAAGA TTGACAGAAG ATCCGGTAAG 

5 ATTCATCAAA TCTGGTGACG CTGCTAWTGT 

CATTRTGTGT TGAAGCTTTC 



PCT/CA00/01150 

GGTGCTGAAT CTTTCAACGC 800 
AATCTYTGCT GGTTACTYTC 850 
CTTGTAAATT CGATGAAATC 900 
AAATTGGAAG AAAATCCAAA 950 
CAAATTTGTT CCATWTAAAC 1000 

1020 



10 2) INFORMATION FOR SEQ ID NO: 1686 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 929 bases 

(B) TYPE: ■ Nucleic acid 
15 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

20 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aspergillus nidulans 

(B) STRAIN: WSA-176 



25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1686 



TGCTGCTTCC GATGGTCAAA TGTACGATTG ATATTCCTTC CAGCCAGTCA 50 

GGATAACAGC TGATACCAGT TGCAAATAGG CCCCAGACTC GTGAGCACTT 100 

GTTGCTTGCC CGTCAGGTTG GTGTCCAGAA GATCGTTGTC TTCGTCAACA 150 

■ AGGTTGACGC TGTCGATGAC CCTGAGATGT TGGAGCTTGT TGAGCTCGAG 200 

30 ATGCGTGAGC TCCTCAACAC TTACGGTTTC GAGGGAGAGG AGACCCCTAT 250 

CATCTTCGGT TCCGCCCTGT GCGCTCTCGA AGGCCGCCGC GAGGACATTG 300 

GTACTCAGCG TATTGACTCC CTCCTCGAGG CCGTTGACAC TTGGATCCCT 350 

ACCCCCCAGC GTGACTTGGA CAAGCCCTTC CTGATGTCCA TTGAGGAAGT 4 00 

TTTCTCCATT GGTGGTCGTG GTACCGTCGC CTCTGGTCGT GTCGAGCGTG 4 50 

35 GTCTCCTCAA GAAGGATACC GAAGTTGAAA TTCACGGTGC TGATGGTATT 500 

CTGAAGACCA AGGTCACCGA CATTGAGACC TTCAAGAAGA GCTGCGATGA 550 

GTCTCGTGCT GGTGACAACT CCGGTCTTCT CCTCCGTGGT ATCCGTCGTG 600 

AGGATGTTCG TCGTGGTATG GTCATCGCTG CCCCTGGCTC CATCAAGGCC 650 

TCCAAGAAGT TCATGGTCTC CATGTACGTC TTGACTGAGG CTGAAGGTGG 700 

4 0 CCGCAAGAAC GGCTTCGGTG CCAACTACCG CCCCCAGGCT TTCATCCGCA 750 

CTGCTGGTAA GTTTCGAACT ATTTGATTCA TTGATCACGT CCCTAACTGT 800 

TACTTTAGAC GAGGCTTGCG ACCTTCATTT CCCTGATGAG GCCGACAAGG 850 

ACCGCCACGT CATGCCCGGT GACAACGTCG AAATGGTCCT CAACCTCAAC 900 

AACCCCGTTG CTGCTGAGGC TGGACAGCG 929 

45 



2) INFORMATION FOR SEQ ID NO: 1687 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 951 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
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(D) TOPOLOGY: Linear 
(ii) MOLECULE TYPE: Genomic DNA 

5 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aspergillus terreus 

(B) STRAIN: WSA-174 



10 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1687 



TGCCGCTTCC GATGGTCAGA TGTACGCTCA AGCCCCAGTT TCCATATAAA 50 

CATAAACGAT CTATCATCAG CACAACGCTG ACTTCTTCGC TTCCAGGCCC 100 

CAGACCCGTG AGCACTTGCT GTTGGCCCGT CAGGTCGGTG TCCAGAAGAT 150 

CGTGGTCTTC GTCAACAAGG TCGATGCCGT TGATGACCCG GAGATGTTGG 200 

15 AGCTCGTTGA GCTGGAAATG CGCGAGCTCC TGACCAGCTA CGGATTCGAG 250 

GGTGAAGAGA CCCCCATCAT CTTCGGTTCT GCTCTCTGCG CTCTTGAGGG 300 

CCGCCGTCCT GAGATTGGTA CTGAGAAGAT TGACGAGCTG ATGCACGCCG 350 

TCGACACCTG GATCCCCACC CCCCAGCGTG ACCTCGACAA GCCCTTCCTG 400 

ATGTCCGTCG AGGAAGTCTT CTCCATTGCT GGTCGTGGTA CCGTCGCTTC 4 50 

20 CGGCCGTGTC GAGCGTGGTA TTCTGAAGAA GGATAGCGAA GTCGAGATCA 500 

TCGGTGGTGC TTTCGACGCC ACGAAGACCA AGGTCACTGA CATCGAGACC 550 

TTCAAGAAGT CTTGCGACGA GTCTCGCGCT GGTGACAACT CTGGTCTCCT 600 

CCTCCGTGGT ATCCGTCGTG AGGATGTTCG GCGTGGTATG GTCATTGCTG 650 

CTCCTGGCAG CACCAAGGCC CACGACAAGT TCCTTGTCTC TATGTACGTC 700 

25 CTCACTGAGG CTGAGGGTGG CCGTCGTACC GGATTCGGTA CCAACTACCG 750 

CCCCCAGGTC TTCATCCGTA CTGCCGGTAA GTGTTCCTGG AAGAGGCTTT 800 

GAGCCTATAT AGGATCTCGG ATAATTTACT AATCCACCAT ATAGATGAGG 850 

CCGCTGACCT CAGCTTCCCC GACAACGATG ACTCCCGCCG TGTCATGCCC 900 

GGTGACAACG TTGAGATGGT CCTGAAGACC CACCGCCCCG TGGCTGCTGA 950 

30 G 951 



2) INFORMATION FOR SEQ ID NO: 1688 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 823 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
40 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
45 (A) ORGANISM: Candida norvegica 

(B) STRAIN: ATCC 36586 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1688 

50 CGTTGCCGCT ACCGATGGTC AAATGCCTCA AACTAGAGAA CATTTGCTAT 50 

TGGCTAGACA GGTTGGTGTT CAACACATTG TCGTGTTTGT TAACAAGGTT 100 

GACACTATTG ATGATCCAGA AATGTTGGAA TTGGTTGAAA TGGAAATGAG 150 

AGAGTTGATT GCCACTTATG GTTTCGATGG TGATAACACC CCAGTTATCA 200 
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TGGGTTCTGC 
CAATCAATCG 
AACTAGAGAT 
CCATTTCTGG 
TTGAAGAAAG 
CAAAACCACC 
CTATGGCTGG 
GATATCAAGA 
CACCAAATTC 
GTCATTCAGG 
GGTGATGTCA 
GCAAGTCTTA 
CAACTGCTCT 



TCTATGTGCT 
ACAGATTGTT 
TTGGAAAAAC 
TAGAGGTACC 
GTGATGAAAT 
GTCACCGGTA 
TGATAACTGT 
GAGGTATGGT 
TTGGCCTCTT 
TTTTGCTGAA 
CTGTTACTTT 
CCAGGTGACA 
TGAAACCGGT 



TTGGAAGGTC GTGAACCTGA AATCGGTGCT 
GGAAGCCGTT GATGAATACA TTCCAACTCC 
CATTCTTGAT GGGTGTTGAA GATGTCTTCT 
GTCTGTACCG GTCGTGTTGA AAGAGGTAAC 
CGAAATTGTC GGCTACAACA AGACTCCAAT 
TTGAGATGTT CAAAAAGGAA TTAGACCAAG 
GGTATCTTAT TACGTGGTGT TAAGAGAGAT 
TATCTCTAAA GTCAACACCG TTTCCGCACA 
TATACGTCTT GACTAAAGAA GAAGGTGGTC 
AACTACAGAC CTCAATTGTT CATCAGAACC 
AACCTTCCCA GAAGATGCTG ATCACTCTCA 
ACGTTGAAAT GGAATGTACC TTGGTTCATC 
CAA 



250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
823 



2) INFORMATION FOR SEQ ID NO: 1689 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 803 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida parapsilosis 

(B) STRAIN: ATCC 20107 6 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1689 

GCTGCTACTG ACGGTCAAAT GCCTCAAACT AGGGAACATA TGTTGTTGGC 50 

GAGACAAGTT GGTATCCAAA ACTTGGTTGT TTTTGTTAAC AAAGTTGATA 100 

CCATTGATGA CCCAGAAATG TTGGAATTGG TTGAAATGGA AATGAGGGAA 150 

TTATTGAGCT CTTATGGGTT TGATGGTGAA AACACTCCAG TTATCATGGG 200 

ATCAGCCTTG TGTGCTTTAG AAGGTAAACA ACCAGAAATC GGTGTTCAAG 250 

CCATTCAAAA ATTATTGGAT GCTGTTGATG AATATATTCC AACTCCAGAA 300 

AGAGATGCTG ACCAACCATT TTTGATGCCA GTGGAAGATG TGTTTTCTAT 350 

TTCAGGTAGA GGAACCGTTG TCACCGGAAG AGTTGAAAGA GGTATGTTGA 4 00 

AGAAAGGTGA AGAAGTAKAA GTCATTGGTG AAAACTCATT TAAGGCTACT 4 50 

TCCACGGGTA TTGAGATGTT CAAAAAGGAA TTGGATGCCG CTATGGCCGG 500 

TGACAACTGT GGTATTTTGT TGAGAGGTGT CAAGAGAGAC GAAGTCAAGA 550 

GGGGTATGGT TTTGGCCAAA CCAGGTACCA CCACCCCACA CCAAAAGTTT 600 

TTGGCTTCCA TTTATATCTT GACTGCTGAA GAAGGTGGAC GTAGTACCCC 650 

TTTCAGTGAA GGATACAAAC CACAATGTTT CTTTAGAACT AGTGATGTTA 700 

CCACGACATT TACTTTCCCA GAAGGTGAAG GTGTTGACCA CTCACAAATG 750 

GTTATGCCAG GAGRCAATGT TGAAATGGTG GGAACTTTAA TCAAGAAAGC 800 

TCC 803 



2) INFORMATION FOR SEQ ID NO: 1690 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1690 
CAGGTCCTGT TGCGACTGAA GAA 



2) INFORMATION FOR SEQ ID NO: 1691 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii)- MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1691 
CACAGATAAA CCTGAGTGTG CTTTC 



2) INFORMATION FOR SEQ ID NO: 1692 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1692 
GGTGAGAACT GTGGTATCTT ACTT 



2) INFORMATION FOR SEQ ID NO: 1693 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
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(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1693 
CATTTCAACG CCTTCTTTCA ACTG 



2) INFORMATION FOR SEQ ID NO: 1694 

(i) SEQUENCE CHARACTERISTICS: , 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1694 
AAGGCAAGGA TGACAACGGC 



2) INFORMATION FOR SEQ ID NO: 1695 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1695 
ACGATTTCCA CTTCTTCCTG G 



2) INFORMATION FOR SEQ ID NO: 1696 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1696 
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ATGTTCCTGT AGTTGCTGGA 20 



2) INFORMATION FOR SEQ ID NO: 1697 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

15 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1697 
TTTCTTCAGC AATACCAACA AC 22 

20 

2) INFORMATION FOR SEQ ID NO: 1698 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

30 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1698 
35 GGAATCAACA GATGGTTTAC AAA 23 



40 



2) INFORMATION FOR SEQ ID NO: 1699 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
45 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1699 

50 

GCATCTTCTG GGAAAGGTGT 20 
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2) INFORMATION FOR SEQ ID NO: 1700 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1700 
AAGATGCGGA AAGAAGCGAA 



2) INFORMATION FOR SEQ ID NO: 1701 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1701 
ATTATGGATC AGTTCTTGGA TCA 



2) INFORMATION FOR SEQ ID NO: 1702 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 213 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus gordonii 

(B) STRAIN: Challis V288 

(C) ACCESSION NUMBER: L20574 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1702 

TTCATAGACG CTGAGCACGC TTTGGATCCA TCTTACGCGG CTGCTCTAGG 
TGTAAATATT GATGAGCTGT TGCTATCTCA ACCAGATTCT GGTGAGCAAG 
GTTTAGAAAT TGCAGGAAAA TTGATTGACT CTGGGGCAGT TGATTTAGTT 
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GTCATCGACT CTGTTGCAGC TCTTGTACCA CGTGCGGAAA TCGATGGAGA 200 
•TATCGGTGAT AGC 213 



2) INFORMATION FOR SEQ ID NO: 1703 



10 



15 



20 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 692 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus mutans 

(B) STRAIN: strain GS-5 

(C) ACCESSION NUMBER: M61897 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1703 



GGGCCGGAAT 
GGCGCAAAAA 

2 5 TTGATCCAGC 
CTTTCACAAC 
GATTGATTCT 
TAGTACCACG 
TTACAAGCAC 

30 CAATAAAACA 
TTGGTATTAT 
AAGTTTTATT 
AGGAACCGGG 
AAGTTGTTAA 

35 ATTATATATG 
CAGTGATTTG 



CTTCTGGTAA 
GATGGCGGTA 
CTATGCTGCT 
CAGATTCAGG 
GGCGCTGTTG 
TGCGGAGATT 
GCATGATGAG 
AAAACCATTG 
GTTTGGTAAT 
CTTCTGTGCG 
GAACAAAAAG 
AAATAAAGTT 
GTGAAGGCAT 
GGAATTATCC 



GACAACTGTC 
TTGCCGCTTT 
GCTCTTGGCG 
AGAACAGGGT 
ATTTAGTTGT 
GACGGAGATA 
TCAAGCGATG 
CTATTTTTAT 
CCAGAAACAA 
TCTTGATGTC 
ACAGCAATAT 
GCTCCACCAT 
TTCTCGTACA 
AAAAAGCTGG 



GCTCTTCATG 
CATTGATGCA 
TTAATATTGA 
CTTGAAATTG 
TGTTGACTCA 
TTGGTAATAG 
CGTAAATTAT 
TAATCAATTG 
CCCCTGGCGG 
CGCGGCAATA 
TGGTAAAGAG 
TTAAGGAAGC 
GGTGAATTAG 
AGCTTGGTAC 



CTGCTGCTCA 50 

GAACATGCCC 100 

TGAGCTTTTG 150 

CAGGGAAATT 200 

GTGGCAGCTT 250 

TCATGTTGGC 300 

CAGCTTCAAT 350 

CGGGAAAAAG 4 00 

GCGTGCCTTG 450 

CTCAAATTAA 500 

ACCAAAATTA 550 

TTTTGTAGAA 600 

TTAAGATTGC 650 

TC 692 



40 2) INFORMATION FOR SEQ ID NO: 1704 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1204 bases 

(B) TYPE: Nucleic acid 
45 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

50 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(C) ACCESSION NUMBER: Z17307 
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ATGGCGAAAA 
AGAACGTGAA 
5 TTGGTAAAGG 
CAAGTGATGA 
TGGTTATCCT 
GTAAGACAAC 
GGGATTGCTG 

10 TGCGGCCCTT 
CAGGAGAGCA 
GTTGATCTTG 
AATTGATGGA 
TGAGCCAGGC 

15 ATTGCCATTT 
AAATCCAGAA 
TCCGCTTGGA 
AAAGAAACCA 
GGTAGCTCCA 

20 GAATTTCTAA 
ATCAAAAAAG 
AGGTTCTGAG 
ATGAAATTGA 
GAAGTTTCAG 

25 AGAAGCAGTG 
TCGAAATTGA 
TCGA 



AACCAAAAAA 
AAGGCCTTGA 
ATCAATCATG 
GCTCAGGTTC 
AAGGGACGTA 
GGTTGCCCTT 
CCTTTATCGA 
GGTGTCAATA 
AGGTCTTGAG 
TCGTAGTCGA 
GATATCGGAG 
CATGCGTAAA 
TTATCAACCA 
ACAACACCGG 
TGTTCGTGGT 
ATGTCGGTAA 
CCGTTTAAGG 
GACTGGTGAG 
CAGGGGCTTG 
AATGCTAAGA 
TAAGCAAGTC 
AACAAGATAC 
AATGAAGAAG 
AGAATAAGCT 



ATTAGAAGAA 
ATGACGCTCT 
CGTTTGGGTG 
TTTAGCTCTT 
TCATCGAAAT 
CATGCAGTTG 
TGCGGAACAT 
TTGACGAATT 
ATTGCGGGAA 
CTCAGTTGCT 
ATAGCCATGT 
CTTGGCGCCT 
ATTGCGTGAA 
GCGGACGTGC 
AATACACAAA 
AGAAACTAAG 
AAGCCGTAGT 
CTTTTGAAGA 
GTATTCTTAC 
AATACTTGGC 
CGTTCTAAAT 
TGAAAACAAA 
TTCCGCTTGA 
GTTAAAGCAG 



ATTTCAAAAA 
TAAATTGATT 
AACGTGCGGA 
GACATTGCCC 
CTATGGCCCA 
CACAAGCGCA 
GCCCTTGATC 
GCTCTTGTCT 
AATTGATTGA 
GCCCTTGTTC 
TGGTTTGCAG 
CTATCAATAA 
AAAGTTGGAG 
TTTGAAATTC 
TTAAGGGAAC 
ATTAAGGTTG 
TGAAATTATG 
TTGCAAGCGA 
AAAGATGAAA 
AGAGCACCCA 
TTGGCTTGAT 
AAAGATGAGC 
CTTAGGCGAT 
TGGAGAAATC 



AATTTGGGGC 


50 


GAGAAAGACT 


100 


GCAAAAGGTG 


150 


TTGGCTCAGG 


200 


GAGTCATCTG 


250 


AAAAGAAGGT 


300 


CAGCTTATGC 


350 


CAACCAGACT 


400 


CTCAGGTGCA 


450 


CTCGTGCGGA 


500 


GCTCGTATGA 


550 


AACCAAAACA 


600 


TGATGTTTGG 


650 


TATGCTTCAG 


700 


TGGTGATCAA 


750 


TAAAAAATAA 


800 


TACGGAGAAG 


850 


TTTGGATATT 


900 


AAATTGGGCA 


950 


GAAATCTTTG 


1000 


TGATGGAGAA 


1050 


CAAAGAAAGA 


1100 


GAACTTGAAA 


1150 


CGCTACTTTT 


1200 




1204 



30. 



35 



40 



45 



50 



2) INFORMATION FOR SEQ ID NO: 1705 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 981 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pyogenes 

(B) STRAIN: NZ131 

(C) ACCESSION NUMBER: U21934 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1705 

ATGCGTTCAG GAAGTCTAGC TCTTGATATT GCTTGGATAG 

TCCTAAAGGA CGTATCATCG AAATCTATGG TCCAGAGTCT 

CGACTGTGGC TTTACATGCT GTAGCACAAG CTCAAAAAGA 

GCAGCCTTTA TCGATGCCGA GCATGCGCTT GATCCAGCTT 

GCTTGGGGTT AATATTGATG AACTTCTCTT GTCTCAACCA 

AACAAGGACT TGAAATTGCA GGTAAATTGA TTGATTCTGG 



CTGGTGGTTA 50 

TCCGGTAAAA 100 

AGGTGGAATC 150 

ATGCTGCTGC 200 

GATTCTGGAG 250 

TGCGGTTGAC 300 
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CTGGTTGTTG 


TCGATTCAGT 


AGCAGCTTTA 


GTGCCACGTG 


CTGAAATTGA 


350 


TGGTGATATT 


GGCGATAGCC 


ATGTCGGATT 


GCAAGCACGT 


ATGATGAGTC 


400 


AGGCCATGCG 


TAAATTATCA 


GCTTCTATTA 


ATAAAACAAA 


AACTATCGCA 


450 


ATCTTTATCA 


ACCAATTGCG 


TGAAAAAGTT 


GGTGTGATGT 


TTGGAAATCC 


500 


TGAAACAACA 


CCAGGTGGTC 


GAGCTTTGAA 


ATTCTATGCT 


TCTGTTCGGC 


550 


TGGATGTGCG 


TGGAAACAAC 


CAAATTAAAG 


GAACTGGTGA 


CCAAAAGATA 


600 


GCCAGCATTG 


GTAAGGAGAC 


CAAAATCAAG 


GTTGTTAAAA 


ACAAGGTCGC 


650 


TCCGCCATTT 


AAGGTAGCAG 


AAGTTGAAAT 


CATGTATGGG 


GAAGGTATTT 


700 


CTCGTACAGG 


GGAGCTTGTG 


AAAATTGCTT 


CTGATTTGGA 


CATTATCCAA 


750 


AAAGCAGGTG 


CTTGGTTCTC 


TTATAATGGT 


GAGAAGATTG 


GCCAAGGTTC 


800 


TGAAAATGCT 


AAGCGTTATT 


TGGCCGATCA 


TCCACAATTG 


TTTGATGAAA 


850 


TCGACCGTAA 


AGTACGTGTT 


AAATTTGGTT 


TGCTTGAAGA 


AAGCGAAGAA 


900 


GAATCTGCTA 


TGGCAGTAGC 


ATCAGAAGAA 


ACCGATGATC 


TTGCTTTAGA 


950 


TTTAGATAAT 


GGTATTGAAA 


TTGAAGATTA 


A 




981 



15 



2) INFORMATION FOR SEQ ID NO: 1706 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 312 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



25 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus salivarivs subsp, 
30 thermophilus 

(C) ACCESSION NUMBER: M94062 



(xi) SEQUENCE DESCRIPTION: SEQ I 

35 GCGTATGCAC GAGCTCTAGG TGTTAATATC 

GCCTGATTCT GGTGAGCAAG GTCTCGAAAT 

CTGGTGCAGT GGATTTAGTT GTTGTTGACT 

CGTGCAGAAA TTGATGGAGA TAGTGGTGAC 

GCGTATGATG AGTCAAGCCA TGCGTAAACT 

4 0 CAAAAACGAT TGCTATCTTT ATTAACCAGT 

ATGTTTGGTA AC 



NO: 1706 

GATGAGCTTC TTTTGTCGCA 50 

TGCAGGTAAG CTGATTGACT 100 

CAGTTGCGGC CTTCGTACCA 150 

AGTCATGTAG GACTTCAAGC 200 

TTCTGCATCT ATTAATAAAA 250 

TGCGTGAAAA AGTTGGTATC 300 

312 



45 2) INFORMATION FOR SEQ ID NO: 1707 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 831 bases 

(B) TYPE: Nucleic acid 
50 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 
(C) ACCESSION NUMBER: J02967 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1707 



ATGAAAAACA 


CAATACATAT 


■ CAACTTCGCT 


ATTTTTTTAA 


TAATTGCAAA 


50 


TATTATCTAC 


AGCAGCGCCA 


GTGCATCAAC 


AGATATCTCT 


ACTGTTGCAT 


100 


CTCCATTATT 


TGAAGGAACT 


GAAGGTTGTT 


TTTTACTTTA 


CGATGCATCC 


150 


ACAAACGCTG 


AAATTGCTCA 


ATTCAATAAA 


GCAAAGTGTG 


CAACGCAAAT 


200 


GGCACCAGAT 


TCAACTTTCA 


AGATCGCATT 


ATCACTTATG 


GCATTTGATG 


250 


CGGAAATAAT 


AGATCAGAAA 


ACCATATTCA 


AATGGGATAA 


AACCCCCAAA 


300 


GGAATGGAGA 


TCTGGAACAG 


CAATCATACA 


CCAAAGACGT 


GGATGCAATT 


350 


TTCTGTTGTT 


TGGGTTTCGC 


AAGAAATAAC 


CCAAAAAATT 


AGATTAAATA 


400 


AAATCAAGAA 


TTATCTCAAA 


GATTTTGATT 


ATGGAAATCA 


AGACTTCTCT 


450 


GGAGATAAAG 


AAAGAAACAA 


CGGATTAACA 


GAAGCATGGC 


TCGAAAGTAG 


500 


CTTAAAAATT 


TCACCAGAAG 


AACAAATTCA 


ATTCCTGCGT 


AAAATTATTA 


550 


ATCACAATCT 


CCCAGTTAAA 


AACTCAGCCA 


TAGAAAACAC 


CATAGAGAAC 


600 


ATGTATCTAC 


AAGATCTGGA 


TAATAGTACA 


AAACTGTATG 


GGAAAACTGG 


650 


TGCAGGATTC 


ACAGCAAATA 


GAACCTTACA 


AAACGGATGG 


TTTGAAGGGT 


700 


TTATTATAAG 


CAAATCAGGA 


CATAAATATG 


TTTTTGTGTC 


CGCACTTACA 


750 


GGAAACTTGG 


GGTCGAATTT 


AACATCAAGC 


ATAAAAGCCA 


AGAAAAATGC 


800 


GATCACCATT 


CTAAACACAC 


TAAATTTATA 


A 




831 



25. 



2) INFORMATION FOR SEQ ID NO: 1708 



30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 84 6 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



35 



[ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecalis 
40 (B) STRAIN: HH22 

(C) ACCESSION NUMBER: M60253 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1708 



45 TTGAAAAAGT TAATATTTTT AATTGTAATT 

TAATTCAAAC AGTTCACATG CCAAAGAGTT 

ATAATGCTCA TATTGGTGTT TATGCTTTAG 

GTAAAATTTA ATTCAGATAA GAGATTTGCC 

GATAAATAGT GCTATTTTGT TAGAACAAGT 

50 AAAAAGTACA TATTAACAAA GATGATATAG 

GAAAAATATG TAGGAAAAGA TATCACTTTA 

AATGACATAT AGTGATAATA CAGCAAACAA 

GTGGAATCAA AAAAGTTAAA CAACGTCTAA 

885 



GCTTTAGTTT 


TAAGTGCATG 


50 


AAATGATTTA 


GAAAAAAAAT 


100 


ATACTAAAAG 


TGGTAAGGAA 


150 


TATGCTTCAA 


CTTCAAAAGC 


200 


ACCTTATAAT 


AAGTTAAATA 


250 


TTGCTTATTC 


TCCTATTTTA 


300 


AAAGCACTTA 


TTGAGGCTTC 


350 


TAAAATTATA 


AAAGAAATCG 


400 


AAGAACTAGG 


AGATAAAGTA 


450 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CAOO/01150 



ACAAATCCAG 
CAAAAAAGAT 
TTATCGCAAA 
TTAATGTTAA 
AAAAGACTAT 
CTAGAAATGA 
GTTTTAGTCA 
TAAGTTGATA 



TTAGATATGA 
ACTTCAACAC 
TGGAAAATTA 
ATAATAAAAG 
AAGGTTGCTG 
TGTTGCTTTT 
TTTTTACGAA 
AGTGAAACCG 



GATAGAATTA 
CTGCTGCTTT 
AGCAAAGAAA 
CGGAGATACT 
ATAAAAGTGG 
GTTTATCCTA 
TAAAGACAAT 
CCAAGAGTGT 



AATTACTATT 
CGGTAAGACT 
ACAAAAAATT 
TTAATTAAAG 
TCAAGCAATA 
AGGGCCAATC 
AAAAGTGATA 
AATGAAGGAA 



CACCAAAGAG 
TTAAATAAAC 
CTTACTTGAT 
ACGGTGTTCC 
ACATATGCTT 
TGAACCTATT 
AGCCAAATGA 
TTTTAA 



500 
550 
600 
650 
700 
750 
800 
846 



10 



15 



20 



25 



30 



35 



40 



45 



2) INFORMATION FOR SEQ ID NO:. 1709 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 555 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Pseudomonas aeruginosa 
(C) ACCESSION NUMBER: M29695 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1709 



ATGTCCGCGA 


GCACCCCCCC 


CATAACTCTT 


CGCCTCATGA 


CCGAGCGCGA 


50 


CCTGCCGATG 


CTCCATGACT 


GGCTCAACCG 


GCCGCACATC 


GTTGAGTGGT 


100 


GGGGTGGCGA 


CGAAGAGCGA 


CCGACTCTTG 


ATGAAGTGCT 


GGAACACTAC 


150 


CTGCCCAGAG 


CGATGGCGGA 


AGAGTCCGTA 


ACACCGTACA 


TCGCAATGCT 


200 


GGGCGAGGAA 


CCGATCGGCT 


ATGCTCAGTC 


GTACGTCGCG 


CTCGGAAGCG 


250 


GTGATGGCTG 


GTGGGAAGAT 


GAAACTGATC 


CAGGAGTGCG 


AGGAATAGAC 


300 


CAGTCTCTGG 


CTGACCCGAC 


ACAGTTGAAC 


AAAGGCCTAG 


GAACAAGGCT 


350 


TGTCCGCGCT 


CTCGTTGAAC 


TACTGTTCTC 


GGACCCCACC 


GTGACGAAGA 


400 


TTCAGACCGA 


CCCGACTCCG 


AACAACCATC 


GAGCCATACG 


CTC-CTATGAG 


450 


AAGGCAGGAT 


TCGTGCGGGA 


GAAGATCATC 


ACCACGCCTG 


ACGGGCCGGC 


500 


GGTTTACATG 


GTTCAAACAC 


GACAAGCCTT 


CGAGAGAAAG 


CGCGGTGTTG 


550 


CCTAA 










555 



2) INFORMATION FOR SEQ ID NO: 1710 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 32 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
(vi) ORIGINAL SOURCE: 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CA00/01150 

(A) ORGANISM: Staphylococcus aureus 
(C) ACCESSION NUMBER: K02 987 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1710 



5 





ATGAACCAGA 


AAAACCCTAA 


AGACACGCAA 


AATTTTATTA 


CTTCTAAAAA 


50 




GCATGTAAAA 


GAAATATTGA 


ATCACACGAA 


TATCAGTAAA 


CAAGACAACG 


100 




TAATAGAAAT 


CGGATCAGGA 


AAAGGACATT 


TTACCAAAGA 


GCTAGTCAAA 


150 




ATGAGTCGAT 


CAGTTACTGC 


TATAGAAATT 


GATGGAGGCT 


TATGTCAAGT 


200 


10 


GACTAAAGAA 


GCGGTAAACC 


CCTCTGAGAA 


TATAAAAGTG 


ATTCAAACGG 


250 




ATATTCTAAA 


ATTTTCCTTC 


CCAAAACATA 


TAAACTATAA 


GATATATGGT 


300 




AATATTCCTT 


ATAACATCAG 


TACGGATATT 


GTCAAAAGAA 


TTACCTTTGA 


350 




AAGTCAGGCT 


AAA1 Al AoL 1 


A 1 L 1 1/41 Lyj 1 


Traps arrri 


ill vjl^o/i/iriri 


don 

H\J\J 




GATTGCAAAA 


TCTGCAACGA 


GCTTTGGGTT 


TACTATTAAT 


GGTGGAGATG 


450 


15 


GATATAAAAA 


TGCTCAAAAA 


AGTACCACCA 


CTATATTTTC 


ATCCTAAGCC 


500 




AAGTGTAGAC 


TCTGTATTGA 


TTGTTCTTGA 


a r* p Zi p zv t p a a 








CAAAGAAGGA 


CTACAAAAAG 


TATCGATCTT 


TTGTTTATAA 


GTGGGTAAAC 


600 




CGTGAATATC 


GTGTTCTTTT 


CACTAAAAAC 


CAATTCCGAC 


AGGCTTTGAA 


650 




GCATGCAAAT 


GTCACTAATA 


TTAATAAACT 


ATCGAAGGAA 


CAATTTCTTT 


700 


20 


CTATTTTCAA 


TAGTTACAAA 


TTGTTTCACT 


AA 




732 




2) INFORMATION FOR SEQ ID NO: 1711 








25 


(i) SEQUENCE CHARACTERISTICS: 










(A) 


LENGTH: 7 38 


bases 










(B) 


TYPE: Nucleic acid 










(C) 


STRANDEDNESS: Double 








30 


(D) 


TOPOLOGY: Linear 










(ii) MOLECULE TYPE: Genomic DNA 










(vi) ORIGINAL SOURCE: 










35 


(A) 


ORGANISM: Escherichia coli 








(B) 


STRAIN: BM2570 










(C) 


ACCESSION NUMBER: M19270 








(xi) SEQUENCE DESCRIPTION: SEQ ID 


NO: 1711 






40 


ATGAACAAAA 


ATATAAAATA 


TTCTCAAAAC 


TTTTTAACGA 


GTGAAAAAGT 


50 




ACTCAACCAA 


ATAATAAAAC 


AATTGAATTT 


AAAAGAAACC 


GATACCGTTT 


100 




ACGAAATTGG 


AACAGGTAAA 


GGGCATTTAA 


CGACGAAACT 


GGCTAAAATA 


150 




AGTAAACAGG 


TAACGTCTAT 


TGAATTAGAC 


AGTCATCTAT 


TCAACTTATC 


200 


45 


GTCAGAAAAA 


TTAAAATCGA 


ATACTCGTGT 


CACTTTAATT 


CACCAAGATA 


250 




TTCTACAGTT 


TCAATTCCCT 


AACAAACAGA 


GGTATAAAAT 


TGTTGGGAAT 


300 




ATTCCTTACC 


ATTTAAGCAC 


ACAAATTATT 


AAAAAAGTGG 


TTTTTGAAAG 


350 




CCATGCGTCT 


GACATCTATC 


TGATTGTTGA 


AGAAGGATTC 


TACAAGCGTA 


400 




CCTTGGATAT 


TCACCGAACA 


CTAGGGTTGC 


TCTTGCACAC 


TCAAGTCTCG 


450 


50 


ATTCAGCAAT 


TGCTTAAGCT 


GCCAGCGGAA 


TGCTTTCATC 


CTAAACCAAG 


500 




AGTAAACAGT 


GTCTTAATAA 


AACTTACCCG 


CCATACCACA 


GATGTTCCAG 


550 




ATAAATATTG 


GAAGCTATAT 


ACGTACTTTG 


TTTCAAAATG 


GGTCAATCGA 


600 




GAATATCGTC 


AACTGTTTAC 


TAAAAATCAG 


TTTCATCAAG 


CAATGAAACA 


650 



887 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



CGCCAAAGTA AACAATTTAA GTACCGTTAC TTATGAGCAA 
TTTTTAATAG TTATCTATTA TTTAACGGGA GGAAATAA 



PCT/CA00/01150 
GTATTGTCTA 



700 
738 



2) INFORMATION FOR SEQ ID NO: 1712 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 735 bases 
10 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



15 



20 



25 



30 



35 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus aureus 

(B) STRAIN: RN451 

(C) ACCESSION NUMBER: M17 990 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1712 



ATGAACGAGA 
TAATATAGAT 
TCTTTGAAAT 
AGGTGTAATT 
TACAGAAAAT 
ATATATTGCA 
AATATACCTT 
TAGTATAGCT 
GATTATTAAA 
GATATTTCTA 
TAGAGTGAAT 
CACACAAAGA 
AAAGAATACA 
ACATGCAGGA 
CTCTTTTCAA 



AAAATATAAA 
AAAATAATGA 
CGGCTCAGGA 
TCGTAACTGC 
AAACTTGTTG 
GTTTAAATTT 
ATAACATAAG 
GATGAGATTT 
TACAAAACGC 
TATTAAGTAT 
AGCTCACTTA 
TAAACAGAAG 
AGAAAATATT 
ATTGACGATT 
TAGCTATAAA 



ACACAGTCAA 
CAAATATAAG 
AAAGGGCATT 
CATTGAAATA 
ATCACGATAA 
CCTAAAAACC 
TACGGATATA 
ATTTAATCGT 
TCATTGGCAT 
GGTTCCAAGA 
TC AG ATT AAA 
TATAATTATT 
TACAAAAAAT 
TAAACAATAT 
TTATTTAATA 



AACTTTATTA 
ATTAAATGAA 
TTACCCTTGA 
GACCATAAAT 
TTTCCAAGTT 
AATCCTATAA 
ATACGCAAAA 
GGAATACGGG 
TATTTTTAAT 
GAATATTTTC 
TAGAAAAAAA 
TCGTTATGAA 
CAATTTAACA 
TAGCTTTGAA 
AGTAA 



CTTCAAAACA 50 

CATGATAATA 100 

ATTAGTACAG 150 

TATGCAAAAC 200 

TTAAACAAGG 250 

AATATTTGGT 300 

TTGTTTTTGA 350 

TTTGCTAAAA 400 

GGCAGAAGTT 450 

ATCCTAAACC 500 

TCAAGAATAT 550 

ATGGGTTAAC 600 

ATTCCTTAAA 650 

CAATTCTTAT 700 
735 



40 



50 



2) INFORMATION FOR SEQ ID NO: 1713 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1029 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecalis 

(B) STRAIN: V583 

888 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



(C) ACCESSION NUMBER: U00456 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1713 



5 


ATGAATAAAA 


TAAAAGTCGC 


AATTATCTTC 


GGCGGTTGCT 


CGGAGGAACA 


50 




TGATGTGTCG 


GTAAAATCCG 


CAATAGAAAT 


TGCTGCGAAC 


ATTAATACTG 


100 




AAAAATTCGA 


TCCGCACTAC 


ATCGGAATTA 


CAAAAAACGG 


CGTATGGAAG 


150 




CTATGCAAGA 


AGCCATGTAC 


GGAATGGGAA 


GCCGATAGTC 


TCCCCGCCAT 


200 




ATTCTCCCCG 


GATAGGAAAA 


CGCATGGTCT 


GCTTGTCATG 


AAAGAAAGAG 


250 


10 


AATACGAAAC 


TCGGCGTATT 


GACGTGGCTT 


TCCCGGTTTT 


GCATGGCAAA 


300 




TGCGGGGAGG 


ATGGTGCGAT 


ACAGGGTCTG 


TTTGAATTGT 


CTGGTATCCC 


350 




CTATGTAGGC 


TGCGATATTC 


AAAGCTCCGC 


AGCTTGCATG 


GACAAATCAC 


400 




TGGCCTACAT 


TCTTACAAAA 


AATGCGGGCA 


TCGCCGTCCC 


CGAATTTCAA 


450 




ATGATTGAAA 


AAGGTGACAA 


ACCGGAGGCG 


AGGACGCTTA 


CCTACCCTGT 


500 


15 


CTTTGTGAAG 


CCGGCACGGT 


CAGGTTCGTC 


CTTTGGCGTA 


ACCAAAGTAA 


550 




ACAGTACGGA 


AGAACTAAAC 


GCTGCGATAG 


AAGCAGCAGG 


' ACAATATGAT 


600 




GGAAAAATCT 


TAATTGAGCA 


AGCGATTTCG 


GGCTGTGAGG 


TCGGCTGCGC 


650 




GGTCATGGGA 


AACGAGGATG 


ATTTGATTGT 


CGGCGAAGTG 


GATCAAATCC 


700 




GGTTGAGCCA 


CGGTATCTTC 


CGCATCCATC 


AGGAAAACGA 


GCCGGAAAAA 


750 


20 


GGCTCAGAGA 


ATGCGATGAT 


TAT'CGTTCCA 


GCAGACATTC 


CGGTCGAGGA 


800 




ACGAAATCGG 


GTGCAAGAAA 


CGGCAAAGAA 


AGTATATCGG 


GTGCTTGGAT 


850 




GCAGAGGGCT 


TGCTCGTGTT 


GATCTTTTTT 


TGCAGGAGGA 


TGGCGGCATC 


900 




GTTCTAAACG 


AGGTCAATAC 


CCTGCCCGGT 


TTTACATCGT 


ACAGCCGCTA 


950 




TCCACGCATG 


GCGGCTGCCG 


CAGGAATCAC 


GCTTCCCGCA 


CTAATTGACA 


1000 


25 


GCCTGATTAC 


ATTGGCGATA 


GAGAGGTGA 






1029 



2) INFORMATION FOR SEQ ID NO: 1714 



30 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 818 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
35 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
40 (A) ORGANISM: Campylobacter jejuni subsp. jejuni 

(B) STRAIN: ATCC 33292 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1714 

45 GCACAGGTTC AGTTGGGCTT GATCTTGCTT TAGGTATAGG CGGTGTTCCA 50 

AAAGGAAGAA T TAT AG AAA T TTATGGRCCT GAAAGTTCAG GTAAAACCAC 100 

TCTAACTCTA CACATTATCG CAGAATGCCA AAAAGCAGGT GGRGTTTGTG 150 

CTTTTATCGA TGCAGAACAT GCRCTTGATG TRAAATATGC TAAAAATTTG 200 

GGTGTAAATA CAGATGATTT GTATGTTTCT CAGCCTGATT TTGGAGARCA 250 

50 AGCCTTAGAA ATTGTAGAAA CTATAGCAAG AAGTGGCGCA GTAGATCTTA 300 

TCGTAGTAGA TAGCGTTGCA GCGCTTACCC CAAAAGCAGA AATTGAAGGC 350 

GATATGGGTG ATCAACATGT AGGACTTCAA GCAAGACTTA TGTCTCAAGC 4 00 

TCTAAGAAAA CTTACAGGTA TAGTTCATAA AATGAATACC ACAGTAATTT 4 50 

889 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CAOO/01150 



TYATCAACCA 
GAAACCACAA 
AGATGTTAGA 
ACCGCGTTAA 
5 CAAGCTGAAT 
ATTGATCGAT 
GGTTTTCTTA 
GCTTTCTTAA 



AATTCGTATG 
CAGGTGGAAA 
AAAGTAGCAA 
AGTAAAAGTA 
TTGATGTGAT 
TATGGTGTAA 
TAAAGATAAA 
AAGAAAAC 



AAAATCGGTG 
TGCATTAAAA 
CCTTAAAACA 
GTTAAAAATA 
GTTTGGAGAG 
AACTTGATAT 
AAACTTGGAC 



CTATGGGTTA 
TTTTATGCTT 
AAACC-AAGAA 
AAGTTGCTCC 
GGTTTAAGCC 
CGTAGATAAA 
AAGGTAGAGA 



TGGTACTCCT 
CTGTGCGTTT 
CCTATAGGAA 
TCCATTCAGA 
GTGAAGGTGA 
AGTGGTGCGT 
AAATTCAAAA 



500 
550 
600 
650 
700 
750 
800 
818 



10 



2) INFORMATION FOR SEQ ID NO: 1715 

. (i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 809 bases 

(B) TYPE: Nucleic acid 

(C) STRAN DE DNESS : Double 

(D) TOPOLOGY: Linear 

20 (ii) MOLECULE TYPE: Genomic DNA 



25 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Abiotrophia adiacens 

(B) STRAIN: ATCC 49175 

(C) ACCESSION NUMBER: AF124224 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1715 



TGGTGCTATC 

30 GTGAACACAT 
TTCTTAAACA 
AGAAATGGAA 
ACACTCCAGT 
TCATACRAAG 

35 TCCAACTCCA 
ACGTGTTCTC 
CGTGGACAAG 
AGAAACTTCA 
TAGACTACGC 

40. ACACGTGACA 
CACTCCACAT 
AAGGTGGACG 
TTCCGTACAA 
AATGGTAATG 

45 CAGTAGCGA 



TTAGTAGTAT 
CTTATTATCA 
AAGTTGACAT 
GTTCGTGACT 
TGTTGCAGGT 
AAAAAATCTT 
GAACGYGACG 
AATCACAGGT 
TTCGTGTTGG 
AAAACAACTG 
TGAAGCAGGG 
ACATCGAACG 
ACTAAATTCA 
TCATACTCCA 
CAGACATCAC 
CCTGGTGATA 



CTGCAGCTGA 
CGTCAAGTAG 
GGTTGACGAT 
TATTATCAGA 
TCTGCTTTAC 
AGAATTAATG 
TTGACAAACC 
CGTGGTACTG 
TGACGAAGTT 
TAACTGGTGT 
GATAACATTG 
TGGACAAGTT 
AAGCTGAAGT 
TTCTTCTCTA 
TGGTGTTTGT 
ACGTAACTAT 



TGGTCCAATG 
GTGTTCCTTA 
GAAGAATTAT 
ATACGATTTC 
GCGCTTTAGA 
GCTGCTGTTG 
ATTCATGATG 
TTGCTACAGG 
GAAATCGTTG 
TGAAATGTTC 
GTACATTATT 
CTTGCTAAAC 
TTACGTATTA 
ACTACCGTCC 
GTGTTACCAG 
GGAAGTTGAA 



CCTCAAACAC 


50 


CATCGTTGTA 


100 


TAGAATTAGT 


150 


CCAGGCGATG 


200 


AGGCGACGCT 


250 


ACGAATACAT 


300 


CCAGTTGAAG 


350 


TCGTGTTGAA 


400 


GTATTTCAGA 


450 


CGTAAATTGT 


500 


ACGTGGTGTT 


550 


CAGGAACAAT 


600 


ACTAAAGAAG 


650 


TCAATTCTAC 


700 


AAGGCGTTGA 


750 


TTAATTCACC 


800 




809 



2) INFORMATION FOR SEQ ID NO: 1716 

50 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 817 bases 

(B) TYPE: Nucleic acid 

890 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CAOO/01150 



(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

5 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Abiotrophia defectiva 

(B) STRAIN: ATCC 4 9176 

(C) ACCESSION NUMBER: AF124225 

10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1716 



CGGCGCGATC 


CTCGTTGTAT 


CTGCTGCTGA 


CGGCCCAATG 


CCACAAACTC 


50 


GTGAACACAT 


CCTCTTGTCT 


CGTCAAGTTG 


GTGTTCCTTA 


CATCGTAGTA 


100 


TTCTTGAACA 


AAGTTGACAT 


GGTTGACGAC 


GAAGAATTGC 


TCGAATTAGT 


150 


TGAAATGGAA 


GTTCGTGACC 


TCTTGTCTGA 


ATACGACTTC 


CCAGGCGACG 


200 


ACACTCCAGT 


TATCGCTGGT 


TCAGCTTTGA 


AAGCTTTAGA 


AGGCGACGCT 


250 


AACTACGAAG 


CTAAAGTTTT 


AGAATTGATG 


GAACAAGTTG 


ATGCTTACAT 


300 


TCCAGAACCA 


GAACGTGACA 


CTGACAAGCC 


ATTCATGATG 


CCAGTCGAAG 


350 


ACGTATTCTC 


TATCACTGGT 


CGTGGTACTG 


TTGCAACTGG 


TCGTGTTGAA 


400 


CGTGGTCAAG 


TTCGCGTTGG 


TGACGAAGTT 


GAAATCGTTG 


GTATCGAAGA 


450 


AGAAACTTCT 


AAGACTACCG 


TTACCGGTGT 


TGAAATGTTC 


CGTAAGTTAT 


500 


TGGATTACGC 


TGAAGCTGGG 


GACAACGTTG 


GTACCTTGTT 


ACGTGGTGTA 


550 


ACTCGTGACC 


AAATCCAACG 


TGGTCAAGTA 


TTATCTAAAC 


CAGGTTCAAT 


600 


CACTCCGYAC 


ACTAAGTTCG 


AAGCTGAAGT 


GTACGTATTG 


TCTAAAGAAG 


650 


AAGGTGGTCG 


TCACACTCCA 


TTCTTCTCTA 


ACTACCGTCC 


ACAATTCTAC 


700 


TTCCGTACAA 


CTGACGTAAC 


TGGTGTTGTT 


ACTTTACCAG 


AAGGTACTGA 


750 


AATGGTTATG 


CCAGGCGACA 


ACGTACAAAT 


GGTTGTTGAA 


TTGATCCACC 


800 


CAATCGCGAT 


CGAAGAA 








817 



30 



2) INFORMATION FOR SEQ ID NO: 1717 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 814 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS:. Double 

(D) TOPOLOGY: Linear 



40 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Corynebacterium accolens 
45 (B) STRAIN: ATCC 49725 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1717 

CGGCGCTATC CTGGTTGTTG CTGCAACCGA TGGCCCGATG CCGCAGACCC 50 

50 GCGAGCACGT TCTGCTTGCT CGCCAGGTTG GCGTTCCTTA CATCCTCGTT 100 

GCACTGAACA AGTGCGACAT GGTTGATGAT GAGGAAATCA TCGAGCTCGT 150 

GGAGATGGAG ATCTCCGAGC TGCTCGCAGA GCAGGACTAC GATGAGGAAG . 200 

CTCCTATCGT TCACATCTCC GCTCTGAAGG CACTCGAGGG TGACGAGAAG 250 

891 



SUBSTITUTE SHEET (RULE 26) 
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PCT/CA00/01150 



15 



TGGGTACAGT 


CCATCGTTGA 


CCTGATGGAT 


GCCTGCGACA 


ACTCCATCCC 


300 


TGATCCGGAG 


CGCGCTACCG 


ATCAGCCGTT 


CTTGATGCCT 


ATCGAGGACA 


350 


TCTTCACCAT 


TACCGGCCGC 


GGTACCGTTG 


TTACCGGCCG 


TGTTGAGCGT 


400 


GGTCGTCTGA 


ACGTCAACGA 


GGACGTTGAG 


ATCATCGGTA 


TCCAGGAGAA 


450 


GTCCCAGAAC 


ACCACCGTTA 


CCGGTATCGA 


GATGTTCCGC 


AAGATGATGG 


500 


ACTACACCGA 


GGCTGGCGAC 


AACTGTGGTC 


TGCTTCTGCG 


TGGTACCAAG 


550 


CGTGAGGACG 


TTGAGCGTGG 


CCAGGTTGTT 


ATCAAGCCGG 


GCGCTTACAC 


600 


CCCTCACACC 


AAGTTCGAGG 


GTTCCGTCTA 


CGTCCTGAAG 


AAGGAAGAGG 


650 


GCGGCCGCCA 


CACCCCGYTC 


ATGAACAACT 


ACCGTCCTCA 


GTTCTACTTC 


700 


CGCACCACCG 


ACGTTACCGG 


TGTTGTGAAC 


CTGCCTGAGG 


GCACCGAGAT 


750 


GGTTATGCCT 


GGCGACAACG 


TTGAGATGTC 


TGTTGAGCTC 


ATCCAGCCTG 


800 


TTGCTATGGA 


CGAG 








814 



2) INFORMATION FOR SEQ ID NO: 1718 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 814 bases 
20 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



25 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Corynebacterium genitalium 

(B) STRAIN: ATCC 33031 



30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1718 



50 



CGGCGCCATC 


CTGGTTGTTG 


CTGCAACCGA 


TGGCCCGATG 


CCGCAGACCC 


50 


GTGAGCACGT 


TCTGCTGGCT 


CGCCAGGTTG 


GCGTTCCGTA 


CATCCTAGTT 


100 


GCACTGAACA 


AGTGCGACAT 


GGTTGATGAT 


GAGGAGCTGC 


TGGAGCTCGT 


150 


CGAGATGGAG 


GTCCGCGAGC 


TGCTGGCTGA 


GCAGGACTTC 


GACGAGGAAG 


200 


CACCTGTTGT 


TCACATCTCC 


GCACTGAAGG 


CCCTGGAGGG 


CGACGAGAAG 


250 


TGGGCTAAGC 


AGATCCTGGA 


GCTCATGGAG 


GCTTGCGACA 


ACTCCATCCC 


300 


GGATCCGGAG 


CGCGAGACCG 


ACAAGCCGTT 


CCTGATGCCG 


GTTGRGGACA 


350 


TCTTCACCAT 


TACCGGCCGC 


GGTACCGTTG 


TTACCGGCCG 


TGTTGAGCGT 


400 


GGCGTCCTGA 


ACCTGAACGA 


CGAGGTCGAG 


ATCCTGGGCA 


TCCGCGAGAA 


450 


GTCCACCAAG 


ACCACCGTTA 


CCTCCATCGA 


GATGTTCAAC 


AAGCTGCTGG 


500 


ACACCGCAGA 


GGCTGGCGAC 


AACGCCGCAC 


TGCTGCTGCG 


TGGCCTGAAG 


550 


CGCGAAGATG 


TTGAGCGTGG 


TCAGATCGTT 


GCTAAGCCGG 


GCGAGTACAC 


600 


CCCGCACACC 


GAGTTCGAGG 


GCTCCGTCTA 


CGTTCTGTCC 


AAGGACGAGG 


650 


GTGGCCGCCA 


CACCCCGTTC 


TTCGACAACT 


ACCGTCCGCA 


GTTCTATTTC 


700 


CGCACCACCG 


ACGTTACCGG 


TGTTGTGAAG 


CTGCCGGAGG 


GCACCGAGAT 


750 


GGTTATGCCG 


GGCGACAACG 


TTGACATGTC 


CGTCACCCTG 


ATCCAGCCGG 


800 


TTGCTATGGA 


CGAG 








814 



2) INFORMATION FOR SEQ ID NO: 1719 



892 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 814 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 
10 (A) ORGANISM: Corynebacteri urn jeikeium 

(B) STRAIN: ATCC 43216 



(xi) SEQUENCE DESCRIPTION: SEQ, ID NO: 1719 



15 


CGGCGCCATC 


CTGGTTGTTG 


CCGCAACCGA 


TGGCCCGATG 


CCGCAGACCC 


50 




GCGAGCACGT 


TCTGCTGGCY 


CGCCAGGTTG 


GCGTTCCGTA 


CATCCTGGTT 


100 




GCACTGAACA 


AGTGTGACAT 


GGTTGACGAT 


GAGGAGCTGC 


TGGAGCTCGT 


150 




CGAGATGGAG 


GTCCGCGAGC 


TGCTGGCTGA 


GCAGGACTTC 


GACGAGGAAG 


200 




CTCCGGTTGT 


TCACATCTCC 


GCACTGAAGG 


CCCTGGAGGG 


CGACGAGAAG 


250 


20 


TGGGCTAACC 


AGATTCTCGA 


GCTGATGCAG 


GCTTGCGACG 


AGTCTATCCC 


300 




GGATCCGGAG 


CGCGAGACCG 


ACAAGCCGTT 


CCTGATGCCG 


GTTGWGGACA 


350 




TCTTCACCAT 


TACCGGTCGC 


GGTACCGTTG 


TTACCGGCCG 


TGTTGAGCGT 


400 




GGCATCCTGA 


ACCTGAACGA 


CGAGGTTGAG 


ATCCTGGGTA 


TCCGCGAGAA 


450 




GTCCCAGAAG 


ACCACCGTTA 


CCTCCATCGA 


GATGTTCAAC 


AAGCTGCTGG 


500 


25 


ACACCGCAGA 


GGCTGGCRAC 


AACGCTGCAC 


TGCTGCTGCG 


TGGTCTGAAG 


550 




CGCGAGGACG 


TTGAGCGTGG 


CCAGATCATC 


GCTAAGCCGG 


GCGAGTACAC 


600 




CCCGCACACC 


GAGTTCGAGG 


GCTCCGTCTA 


CGTTCTGTCC 


AAGGACGAGG 


650 




GCGGCCGCCA 


CACCCCGTTC 


TTCGACAACT 


ACCGTCCGCA 


GTTCTACTTC 


700 




CGCACCACCG 


ACGTTACCGG 


TGTTGTGAAG 


CTGCCTGAGG 


GCACCGAGAT 


750 


30 


GGTTATGCCG 


GGCGACAACG 


TYGACATGTC 


CGTCACCCTG 


ATCCAGCCGG 


800 




TTGCTATGGA 


CGAG 








814 



35 2) INFORMATION FOR SEQ ID NO: 1720 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 748 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



45 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Corynebacterium pseudodiphtheriticum 

(B) STRAIN: ATCC 10700 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1720 



CGGCGCTATC TTGGTTGTTG 
GCGAGCACGT TCTGCTGGCT 
GCACTAAACA AGTGCGACAT 



CAGCTACCGA CGGCCCAATG 

CGCCAGGTTG GCGTTCCTTA 

GGTTGACGAC GAGGAAATCC 

893 



CCACAGACTC 50 
CATCCTGGTT 100 
TCGAGCTCGT 150 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



CGAGATGGAG 


ATCCGCGAAT 


TGCTGGCTGA 


CCAGGAATTC 


GACGAAGAAG 


200 


CTCCAATCGT 


TCACATCTCC 


GCAGTCGGCG 


CCTTGGAAGG 


CGAAGAGAGG 


250 


TGGGTTAACG 


CCATCGTTGA 


ACTGATGGAT 


GCTTGTGACG 


AGTCGATCCC 


300 


TGATCCAGAC 


CGTGCTACCG 


ACAAGCCATT 


CCTGATGCCT 


ATCGAGGACA 


350 


TCTTCACCAT 


TACCGGTCGT 


GGCACCGTTG 


TTACGGGTCG 


TGTTGAGCGT 


400 


GGT'TCCCTGA 


AGGTCAACGA 


AGAAGTCGAG 


ATCATCGGCA 


TCAAGGAAAA 


450 


GTCCCAGAAG 


ACCACCATCA 


CCGGTATCGA 


AATGTTCCGC 


AAGATGCTGG 


500 


ACTACACCGA 


GGCCGGCGAC 


AACGCTGGTC 


TGCTGCTTCG 


CGGTACCAAG 


550 


CGTGAAGACG 


TTGAGCGTGG 


ACAGGTTATC 


GTTGCTCCAG 


GTGCTTACAG 


600 


CACCCACAAG 


AAGTTCGAAG 


GTTCCGTCTA 


CGTTCTTTCC 


AAGGACGAGG 


650 


GCGGCCGCCA 


CACCCCGTTC 


TTCGACAACT 


ACCGTCCTCA 


GTTCTACTTC 


700 


CGCACCACCG 


ACGTTACCGG 


TGTTGTTACC 


CTGCCTGAGG 


GCACCGAG 


748 



15 



2) INFORMATION FOR SEQ ID NO: 1721 



20 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 813 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



25 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Corynebacterium striatum 

(B) STRAIN: ATCC 6940 



30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1721 



35 



40 



45 



GGCGCTATCT 


TGGTTGTTGC 


TGCAACCGAT 


GGCCCGRTGC 


CGCAGACCCG 


50 


CGAGCACGTT 


CTTCTGGCTC 


GCCAGGTTGG 


CGTTCCTTAC 


ATCCTCGTTG 


100 


CACTGAACAA 


GTGCGACATG 


GTTGACGACG 


AGGAAATTAT 


CGAGCTCGTC 


150 


GAGATGGAGA 


TCCGCGAACT 


GCTCGCAGAG 


CAGGACTACG 


ATGAGGAAGC 


200 


TCCGATCGTT 


CACATCTCTG 


CTCTGAAGGC 


TCTTGAGGGC 


GRCGAGAAGT 


250 


GGGTACAGGC 


TATCGTTGAC 


CTGATGCAGG 


CTTGCGATGA 


CTCCATCCCG 


300 


GATCCGGAGC 


GCGAGCTGGA 


CAAGCCGTTC 


CTGATGCCAA 


TCGAGGACAT 


350 


CTTCACCATC 


ACCGGCCGCG 


GTACCGTTGT 


TACTGGCCGT 


GTTGAGCGTG 


400 


GCTCCCTGAA 


CGTCAACGAG 


GACGTTGAGA 


TCATCGGTAT 


CCAGGACARG 


450 


TCCATCTCCA 


CCACCGTTAC 


CGGTATCGAG 


ATGYTCCGCA 


AGATGATGGA 


500 


CTACACCGAG 


GCTGGCGACA 


ACTGTGGTCT 


GCTTCTGCGT 


GGTACCAAGC 


550 


GTGAAGAGGT 


TGAGCGCGGC 


CAGGTTGTTA 


TTAAGCCGGG 


CGCTTACACC 


600 


CCTCACACCC 


AGTTCGAGGG 


TTCCGTCTAC 


GTCCTGAAGA 


AGGAAGAGGG 


650 


CGGCCGCCAC 


ACCCCGTTCA 


TGGACAACTA 


CCGTCCGCAG 


TTCTACTTCC 


700 


GCACCACCGA 


CGTTACCGGC 


GTCATCAAGC 


TGCCTGAGGG 


CACCGAGATG 


750 


GTTATGCCTG 


GCGACAACGT 


CGAGATGTCY 


GTCGAGCTGA 


TCCAGCCGGT 


800 


CGCTATGGAC 


GAG 








813 



50 



2) INFORMATION FOR SEQ ID NO: 1722 



894 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CAOO/01150 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 817 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



10 



15 



20 



25 



30 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus avium 

(B) STRAIN: ATCC 14025 

(C) ACCESSION NUMBER: AF124220 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1722 



CGGAGCTATC 
GTGAACACAT 
TTCTTAAACA 
TGAAATGGAA 
ACACTCCAGT 
TCATACGAAG 
CCCAACACCA 
ACGTATTCTC 
CGTGGACAAG 
CGAAACTGCT 
TAGACTACGC 
GCACGTGAAG 
CACTCCACAT 
AAGGTGGACG 
TTCCGTACAA 
AATGGTWATG 
CAATYGCGGT 



TTAGTAGTAT 
CTTGTTATCT 
AAATGGATAT 
GTTCGTGACT 
TATCGCAGGT 
AAAAAATCTT 
GTTCGTGATA 
AATCACTGGT 
TTCGCGTTGG 
AAAACAACTG 
TGAAGCAGGT 
ATATCCAACG 
ACAAAATTCT 
TCATACTCCA 
CTGACGTAAC 
CCTGGGGATA 
AGAAGAC 



CTGCTGCTGA 
CGTAACGTTG 
GGTTGACGAT 
TATTAACTGA 
TCAGCGTTGA 
AGAATTAATG 
CTGACAAACC 
CGTGGTACTG 
TGACGAAGTT 
TTACAGGTGT 
GACAACATCG 
TGGACAAGTA 
CTGCAGAAGT 
TTCTTCACTA 
TGGTGTAGTT 
ACGTAACTAT 



TGGCCCTATG 
GTGTTCCTTA 
GAAGAATTAC 
ATACGACTTC 
AAGCTTTAGA 
GCTGCTGTTG 
ATTCATGATG 
TTGCAACTGG 
GAAATCGTAG 
TGAAATGTTC 
GTGCTTTGTT 
TTGGCTAAAC 
TTATGTTCTA 
ACTACCGTCC 
GATCTACCAG 
GGAAGTTGAA 



CCTCAAACTC 50 

CATCGTTGTA 100 

TTGAATTAGT 150 

CCAGGCGACG 200 

AGGCGACGCT 250 

ACGAATATAT 300 

CCAGTCGAAG 350 

TCGTGTTGAA 400 

GTATCGCTGA 450 

CGTAAATTGT 500 

ACGTGGTGTT 550 

CAGCTTCAAT 600 

ACTAAAGAAG 650 

TCAGTTCTAC 700 

AAGGTACTGA 750 

TTGATYCACC 800 
817 



35 



2) INFORMATION FOR SEQ ID NO: 1723 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 823 bases 
40 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



45 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Gardnerelia vaginalis 

(B) STRAIN: ATCC 14018 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1723 



TGGCGCAATC CTCGTGGTTG CTGCTACCGA CGGTCCAATG GCTCAGACCC 50 
GTGAACACGT CTTGCTTGCT AAGCAGGTCG GCGTTCCAAA AATTCTTGTT 100 

895 
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PCT7CA00/01150 






GCTTTGAACA 


AGTGCGATAT 


GGTTGACGAC 


Tv TV /^Tv/^prprTiTi 

GAAGAGL 1 1 A 


lCGATLILbi 


1 Oil 




TGAAGAAGAG 


GTCCGTGACC 


TCCTCGAAGA 


AAALbbLl 1L 


GATCGLbAl 1 


^ uu 




GCCCAGTCYT 


CCGTACTTCC 


GCTTACGGCG 


LTTlbLAlbA 


IGACGLILLA 


^ ou 




/*"* TV O /"* T\ O TV /""" TV 

GACCACGACA 


AGTGGGTAGA 


GALLGTCAAG 


bAAL 1 LA 1 bA 


AbbL I b 1 I b/i 




5 


LbAGTALATC 


CCAACCLLAA 


L 1 LALbAi LI 


1 bAL AAbLLA 


1 i Li 1 br\l bl 






/""« TV T\ rp /->/-> 7\ TV Tv 

CAATCGAAGA 


TGTGTTCALL 


AILTCLbbTL 


bi bb 1 TLLbl 


IbiLALLotj J. 






CGTGTTGAGC 


GTGGTAAGCT 


CCCAATCAAC 


ACLLLAbl 1 b 


7\/^Tvrp/^/-^rprn/^/-« 

AbAI Lbl lbb 


>1 a 




TTTGCGCGAT 


ACCCAGACCA 


CCACCGTCAC 


/^rn/*«rnTv rpr"" f T\ 

CTCTATCGAG 


Tv /~ , ^ ,r r k T l /^'/^ s T\ ^« TV 

ACCTTLLALA 


ouu 




TV /-l TV /"> 7\ m /*■ TV 

AGCAGATGGA 


TGAGGCAGAG 


GCTGGCGATA 


tv t\ r , TT , r' r nr ,| T' 

ACACTGGILT 


lCTTClLLbL 


DDI) 


10 


GGTATCAACC 


GTACCGACGT 


TGAGCGTGGT 


CAGGTTGTGG 


LIbL 1 LLAbb 


dUU 




TTCTGTGACT 


TV TV /"» TV Z""' /** TV 

CCACACACCA 


7\ /"• rp m /*"* /"» TV TV 

AGTTCGAAGG 


CGAAGTTTAC 


GTLTTGALLA 


DDL) 




AGGACGAAGG 


TGGCCGTCAC 


1CGCCATTCT 


IClCLAAOi A 


LLb 1 LLALAb 


/ uu 




TTCTACTTCC 


GTACCACCGA 


TGTTACTGGC 


GTTATCACCT 


TGCCAGACGG 


750 




CATCGAAATG 


GTTCAGCCAG 


GCGATCACGC 


AACCTTCACT 


GTTGAGTTGA 


800 


15 


TCCAGGCTAT 


CGCAATGGAA 


GAG 






823 


/-> r\ 


2) INFORMATION FOR SEQ ID NO: 1724 








20 


(i) SEQUENCE CHARACTERISTICS: 










(A) 


LENGTH: 817 


bases 










(B) 


TYPE: Nucleic acid 










(C) 


STRANDEDNESS: Double 








25 


(D) 


TOPOLOGY: Linear 










(ii) MOLECULE TYPE: Genomic DNA 










(vi) ORIGINAL SOURCE: 










30 


(A) 


ORGANISM: Listeria innocua 








(B) 


STRAIN: ATCC 33090 










(xi) SEQUENCE DESCRIPTION: SEQ ID 


NO: 1724 






35 


CGGAGCTATC 


TTAGTAGTAT 


CTGCTGCTGA 


TGGCCCAATG 


PPTiPTt T\ 7\pfpp 

CCACAAALTC 


c. n 
DU 




GTGAACATAT 


CTTACTTTCA 


CGTCAAGTTG 


GTGTTCCATA 


PT, m/-^/^«rprp/-'rpTv 

CATCGTTGTA 


inn 
1UU 




TTCATGAACA 


AATGTGACAT 


GGTTGACGAT 


GAAGAATTAC 


rp tv /"TV tv rp rp tv rp 

1 AGAA1 I Abl 


lou 




TGAAATGGAA 


ATTCGTGATC 


TATTAACTGA 


ATATGAATTC 


PPTipPPr7\ rp <^ 

CCTGGCGA1G 


0 n n 




ACATTCCTGT 


AATCAAAGGT 


TCAGCTCTTA 


AAGCACTTCA 


Tv z^"* /"^ rp /*> T\ tv pprp 

AGGTGAAGCT 


^oU 


40 


GACTGGGAAG 


CTAAAATTGA 


CGAGTTAATG 


GAAGCTGTAG 


ATTCTTACAT 


JUU 




TCCAACTCCA 


GAACGTGATA 


CTGACAAACC 


ATTCATGATG 


/~* /~y tv /-» rp rn r*^ TV 

CCAGTTGAGG 


0 c. r» 
JOU 




ATGTATTCTC 


AATCACTGGT 


CGTGGAACAG 


TTGCAACTGG 


Tv / r "*/~ , rp/Tnrp/""" > TV TV 

ACGTGTTGAA 


4 UU 




CGTGGACAAG 


TTAAAGTTGG 


TGACGAAGTA 


GAAGTTATCG 


/"TP TV rp rTl TV TV f~* TV 

GTATTGAAGA 


4 DU 




AGAAAGCAAA 


AAAGTAGTAG 


TAACTGGAGT 


AGAAATGTTC 


/~« /— ' rp TV TV TV rp rp tv 

CGTAAAI I AL 


ouu 


45 


TAGACTACGC 


TGAAGCTGGC 


GACAACATTG 


GCGCACTTCT 


ACG rGbJ. b i I 


OOU 




GCTCGTGAAG 


ATATCCAACG 


TGGTCAAGTA 


TTAGCTAAAC 


LAbbl 1 LbAl 


ouu 




TACTCCACAC 


ACTAACTTCA 


AAGCTGAAAC 


TTATGTTTTA 


TiPfpTi TV TV Tv TV 

AL 1 AAAbAAb 






AAGGTGGACG 


TCACACTCCA 


TTCTTCAACA 


ACTACCGCCC 


7\ /-^ tv t\ rp rp 0 rp tv rp 

ACAAT1 LI Al 


7 no. 
/ UU 




TTCCGTACTA 


CTGACGTAAC 


TGGTATTGTT 


ACACTTCCAG 


tv Tv m t\ /■> rn ^» tv 

AAGGTACTGA 


T C n 

750 


50 


AATGGTAATG 


CCTGGTGATA 


ACATTGAGCT 


TGCAGTTGAA 


CTAATTGCAC 


800 




CAATCGCTAT 


CGAAGAC 








817 



896 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



2) INFORMATION FOR SEQ ID NO: 1725 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 818 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



. (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Listeria ivanovii 

(B) STRAIN: ATCC 19119 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1725 



CGGAGCTATC 
GTGAACATAT 
ATTCATGAAC 
TTGAAATGGA 
GACATTCCTG 
TGATTGGGAA 
TTCCAACTCC 
GATGTATTCT 
ACGTGGACAA 
AAGAAAGCAA 
CTAGACTACG 
TGCTCGTGAA 
TTACTCCACA 
GAAGGTGGAC 
TTTCCGTACT 
AAATGGTAAT 
CCAATCGCTA 



TTAGTAGTAT 
TCTTACTTTC 
AAATGTGACA 
AATTCGTGAT 
TAATCAAAGG 
GCTAAAATTG 
AGAACGTGAT 
CAATCACTGG 
GTTAAAGTTG 
AAAAGTAGTA 
CTGAAGCTGG 
GATATCCAAC 
TACTAACTTC 
GTCATACTCC 
ACTGACGTAA 
GCCTGGTGAT 
TCGAAGAC 



CTGCTGCTGA 
ACGTCAAGTT 
TGGTTGACGA 
CTATTAACTG 
TTCAGCTCTT 
ACGAGTTAAT 
ACTGACAAAC 
TCGTGGAACA 
GTGACGAAGT 
GTAACTGGAG 
CGACAACATT 
GTGGTCAAGT 
AAAGCTGAAA 
ATTCTTCAAC 
CTGGTATTGT 
AACATTGAGC 



TGGTCCAATG 
GGTGTTCCAT 
TGAAGAATTA 
AATATGAATT 
AAAGCACTTC 
GGAAGCTGTA 
CATTCATGAT 
GTTGCAACTG 
AGAAGTTATC 
TAGAAATGTT 
GGCGCACTTC 
ATTAGCTAAA 
CTTATGTTTT 
AACTACCGCC 
TACACTTCCA 
TTGCAGTTGA 



CCACAAACTC 
ACATCGTTGT 
CTTGAATTAG 
CCCTGGCGAC 
AAGGTGAAGC 
GATTCTTACA 
GCCAGTTGAG 
GACGTGTTGA 
GGTATTGAAG 
CCGTAAATTA 
TACGTGGTGT 
CCAGGTTCGA 
AACTAAAGAA 
CACAATTCTA 
GAAGGTACTG 
ACTAATTGCA 



2) INFORMATION FOR SEQ ID NO: 1726 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 817 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Listeria monocytogenes 

(B) STRAIN: LSPQ 5093202 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1726 
CGGAGCTATC TTAGTAGTAT CTGCTGCTGA TGGCCCAATG CCACAAACTC 

897 

SUBSTITUTE SHEET (RULE 26) 
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GTGAACATAT 


CTTACTTTCA 


CGTCAAGTTG 


GTGTTCCATA 


CATCGTTGTA 


100 


TTCATGAACA 


AATGTGACAT 


GGTTGACGAT 


GAAGAATTAC 


TAGAATTAGT 


150 


TGAAATGGAA 


ATTCGTGATC 


TATTAACTGA 


ATATGAATTC 


CCTGGCGATG 


200 


ACATTCCTGT 


AATCAAAGGT 


TCAGCTCTTA 


AAGCACTTCA 


AGGTGAAGCT 


250 


GACTGGGAAG 


CTAAAATTGA 


CGAGTTAATG 


GAAGCTGTAG 


ATTCTTACAT 


300 


TCCAACTCCW 


GAACGTGATA 


CTGACAAACC 


ATTCATGATG 


CCAGTTGAGG 


350 


ATGTATTCTC 


AATCACTGGT 


CGTGGAACAG 


TTGCAACTGG 


ACGTGTTGAA 


400 


CGTGGACAAG 


TTAAAGTTGG 


TGACGAAGTA 


GAAGTTATCG 


GTATCGAAGA 


450 


AGAAAGCAAA 


AAAGTAGTAG 


TAACTGGAGT 


AGAAATGTTC 


CGTAAATTAC 


500 


TAGACTACGC 


TGAAGCTGGC 


GACAACATTG 


GCGCACTTCT 


ACGTGGTGTT 


550 


GCTCGTGAAG 


ATATCCAACR 


TGGTCAAGTA 


TTAGCTAAAC 


CAGGTTCGAT 


600 


TACTCCACAC 


ACTAACTTCA 


AAGCTGAAAC 


TTATGTTTTA 


ACTAAAGAAG 


650 


AAGGTGGACG 


TCACACTCCA 


TTCTTCAACA 


ACTACCGCCC 


ACAATTCTAT 


700 


TTCCGTACTA 


CTGACGTAAC 


TGGTATTGTT 


ACACTTCCAG 


AAGGTACTGA 


750 


AATGGTAAYG 


CCTGGTGATA 


ACATTGAGCT 


TGCAGTTGAA 


CTAATTGCAC 


800 


CAATCGCTAT 


CGAAGAC . 








817 



20 2) INFORMATION FOR SEQ ID NO: 1727 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 817 bases 

(B) TYPE: Nucleic acid 
25 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



30 (vi) ORIGINAL SOURCE: 

(A) 1 ORGANISM: Listeria seeligeri 

(B) STRAIN: ATCC 35967 



35 



40 



45 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1727 



CGGAGCTATC TTAGTAGTAT 

GTGAACATAT CTTACTTTCA 

TTCATGAACA AATGTGACAT 

TGAAATGGAA ATTCGTGATC 

ACATTCCTGT AATCAAAGGT 

GACTGGGAAG CTAAAATTGA 

TCCAACTCCA GAACGTGATA 

ATGTATTCTC AATCACTGGT 

CGTGGACAAG TTAAAGTTGG 

AGAAAGCAAA AAAGTAATAG 

TAGACTACGC TGAAGCTGGC 

GCTCGTGAAG ATATCCAACG 

TACTCCACAT ACTAACTTCA 

AAGGTGGACG TCACACTCCA 

TTCCGTACTA CTGACGTAAC 

AATGGTAATG CCTGGTGATA 

CAATCGCTAT CGAAGAC 



CTGCTGCTGA 
CGTCAAGTTG 
GGTTGACGAT 
TATTAACTGA 
TCAGCTCTTA 
CGAGTTAATG 
CTGACAAACC 
CGTGGAACTG 
TGACGAAGTA 
TAACTGGAGT 
GACAACATTG 
TGGTCAAGTA 
AAGCTGAAAC 
TTCTTCAACA 
TGGTATTGTT 
ACATTGAGCT 



TGGCCCAATG 
GTGTTCCATA 
GAAGAATTAC 
ATATGAATTC 
AAGCACTTCA 
GAAGCTGTAG 
ATTCATGATG 
TTGCAACTGG 
GAAGTTATCG 
AGAAATGTTC 
GCGCACTTCT 
TTAGCTAAAC 
TTATGTTTTA 
ACTACCGCCC 
ACACTTCCAG 
TGCAGTTGAA 



CCACAAACTC 50 

CATCGTTGTA 100 

TTGAATTAGT 150 

CCTGGTGATG 200 

AGGTGAAGCT 250 

ATTCTTACAT 300 

CCAGTTGAGG 350 

ACGTGTTGAA 400 

GTATTGAAGA 450 

CGTAAATTAC 500 

ACGTGGTGTT 550 

CAGGTTCGAT 600 

ACTAAAGAAG 650 

ACAATTCTAT 700 

AAGGTACTGA 750 

CTAATTGCAC 800 
817 
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2) INFORMATION FOR SEQ ID NO: 1728 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 814 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus aureus 

(B) STRAIN: ATCC 25923 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1728 



CGGTGGTATC 
GTGAACACAT 
TTCTTAAACA 
AGAAATGGAA 
ATGTACCTGT 
CAATACGAAG 
TCCAACTCCA 
ACGTATTCTC 
CGTGGTCAAA 
CACATCTAAA 
ACTACGCTGA 
CGTGAAGACG 
ACCACATACT 
GTGGACGTCA 
CGTACTACTG 
GGTAATGCCT 
TCGCGATTGA 



TTAGTAGTAT 
TCTTTTATCA 
AAGTTGACAT 
GTTCGTGACT 
AATCGCTGGT 
AAAAAATCTT 
GAACGTGATT 
AATCACTGGT 
TCAAAGTTGG 
ACAACTGTTA 
AGCTGGTGAC 
TACAACGTGG 
GAATTCAAAG 
CACTCCATTC 
ACGTAACTGG 
GGTGATAACG 
AGAC 



CTGCTGCTGA 
CGTAACGTTG 
GGTTGACGAT 
TATTAAGCGA 
TCAGCATTAR 
AGAATTARTG 
CTGACAAACC 
CGTGGTACTG 
TGAAGAAGTT 
CAGGTGTTGA 
AACATTGGTG 
TCAAGTATTA 
CAGAAGTATA 
TTCTCAAACT 
TGTTGTTCAC 
TTGAAATGAC 



CGGTCCAATG 
GTGTACCAGC 
GAAGAATTAT 
ATATGACTTC 
AAGCTTTAGA 
GAAGCTGTAG 
ATTCATGATG 
TTGCTACAGG 
GAAATCATCG 
AATGTTCCGT 
CATTATTACG 
GCTGCTCCTG 
CGTATTATCA 
ATCGTCCACA 
TTACCAGAAG 
AGTAGAATTA 



CCACAAACTC 
ATTAGTAGTA 
TAGAATTAGT 
CCAGGTGACG 
AGGCGATGCT 
ATACTTACAT 
CCAGTTGAGG 
CCGTGTTGAA 
GTTTACATGA 
AAATTATTAG 
TGGTGTTGCT 
GTTCAATTAC 
AAAGACGAAG 
ATTCTATTTC 
GTACTGAAAT 
ATCGCTCCAA 



2) INFORMATION FOR SEQ ID NO: 1729 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 817 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus saprophyticus 

(B) STRAIN: ATCC 15305 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1729 

CGGAGCTATC TTAGTAGTAT CTGCTGCTGA TGGCCCAATG CCACAAACTC 
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GTGAACACAT 


TCTTTTATCA 


CGTRACGTTG 


GTGYTCCAGC 


ATTAGTTGTA 


100 


TTCTTAAACA 


AAGTTGACAT 


GGTTGACGAY 


GAAGAATTAT 


TAGAATTRGT 


150 


AGAAATGGAA 


GTTCGTGRCT 


TATTAAGCGA 


ATATGACTTC 


CCAGGTGACG 


200 


ATGTACCTGT 


AATCTCTGGT 


TCTGCATTAA 


AAGCTTTAGA 


AGGCGACGCT 


250 


GACTATGAGC 


AAAAAATCTT 


AGACTTAATG 


CAAGCTGTTG 


ATGACTYCAT 


300 


TCCAACACCA 


GAACGTGATT 


CTGACAAACC 


ATTCATGATG 


CCAGTTGAGG 


350 


ACGTATTCTC 


AATCACTGGT 


CGTGGTACTG 


TTGCTACAGG 


CCGTGTTGAA 


400 


CGTGGTCAAA 


TCAAAGTCGG 


TGAAGAAATC 


GARATCATCG 


GTATGCAAGA 


450 


AGAATCAAGC 


AAAACAACTG 


TTACTGGTGT 


AGAAATGTTC 


CGTAAATTAT 


500 


TAGACTACGC 


TGAAGCTGGT 


GACAACATTG 


GTGCATTATT 


ACGTGGTGTT 


550 


TCACGTGATG 


ATGTACAACG 


TGGTCAAGTT 


TTAGCTGCTC 


CTGGTACTAT 


600 


CACACCACAT 


ACAAAATTCA 


AAGCGGATGT 


TTACGTTTTA 


TCTAAAGATG 


650 


AAGGTGGTCG 


TCATACGCCA 


TTCTTCACTA 


ACTACCGCCC 


ACAATTCTAT 


700 


TTCCGTACTA 


CTGACGTAAC 


TGGTGTTGTT 


AACTTACCAG 


AAGGTACTGA 


750 


AATGGTTATG 


CCTGGCGATA 


ACGTTGAAAT 


GGATGTTGAA 


TTAATTTCTC 


800 


CAATCGCTAT 


TGAAGAC ■ 








817 



20 2) INFORMATION FOR SEQ ID NO: 1730 



25 



35 



40 



45 



50 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 817 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

30 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus simulans 

(B) STRAIN: ATCC 27848 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1730 



CGGCGGTATC TTAGTAGTAT 

GTGAACACAT CTTATTATCA 

TTCTTAAACA AAGCTGACAT 

TGAAATGGAA GTTCGTGACT 

ATGTACCAGT TATCGTTGGT 

GAATACGAAC AAAAAATCTT 

CCCAACTCCA GAACGTGACT 

ACGTATTCTC AATCACTGGT 

CGTGGTCAAA TCAAAGTCGG 

AGAAAGCAAG AAAACAACAG 

TAGACTACGC TGAAGCTGGT 

GCACGTGAAG ACGTACAACG 

TACTCCACAC ACAAAATTCA 

AAGGTGGACG TCATACTCCA 

TTCCGTACTA CTGACGTAAC 

AATGGTTATG CCTGGCGATA 

CAATCGCGAT TGAAGAC 



CTGCTGCAGA 
CGTAACGTTG 
GGTTGACGAC 
TATTATCTGA 
TCTGCATTAA 
AGACTTAATG 
CTGATAAACC 
CGTGGTACTG 
TGAAGAAGTT 
TTACAGGTGT 
GACAACATCG 
TGGACAAGTA 
AAGCTGATGT 
TTCTTCACTA 
TGGCGTTGTT 
ACGTAGAAAT 



TGGTCCAATG 
GTGTACCAGC 
GAAGAATTAT 
ATACGACTTC 
AAGCTTTAGA 
CAAGCTGTAG 
ATTCATGATG 
TAGCAACAGG 
GAAATCATCG 
AGAAATGTTC 
GTGCTTTATT 
TTAGCAGCTC 
TTACGTTTTA 
ACTACCGCCC 
CACTTACCAG 
GACTGTTGAA 



CCACAAACTC 50 

TTTAGTTGTA 100 

TAGAATTAGT 150 

CCTGGTGACG 200 

AGGCGACCCA 250 

ATGACTACAT 300 

CCAGTTGAGG 350 

CCGTGTTGAA 400 

GTATCACTGA 450 

CGTAAATTAT 500 

ACGTGGTGTT 550 

CTGGCTCTAT 600 

TCTAAAGAAG 650 

ACAATTCTAC 700 

AAGGTACTGA 750 

TTGATCGCTC 800 
817 
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2) INFORMATION FOR SEQ ID NO: 1731 



PCT/CAOO/01150 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 817 bases 
5 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

10 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus agalactiae 

(B) STRAIN: ATCC 27591 

(C) ACCESSION NUMBER: 

15 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1731 



CGGAGCTATC 


CTTGTAGTTG 


CTTCAACTGA 


TGGACCAATG 


CCACAAACTC 


50 


GTGAGCACAT 


CCTTCTTTCA 


CGTCAAGTTG 


GTGTTAAACA 


CCTTATCGTA 


100 


TTCATGAACA 


AAGTTGACCT 


TGTTGATGAT 


GAAGAATTGC 


TTGAATTGGT 


150 


TGAAATGGAA 


ATTCGTGACC 


TTCTTTCAGA 


ATACGACTTC 


CCAGGTGATG 


200 


ACCTTCCAGT 


TATCCAAGGT 


TCAGCTCTTA 


AAGCACTTGA 


AGGCGACGAA 


250 


AAATACGAAG 


ACATCATCAT 


GGAATTGATG 


AGCACTGTTG 


ATGAGTACAT 


300 


TCCAGAACCA 


GAACGTGATA 


CTGACAAACC 


TTTACTTCTT 


CCAGTTGAAG 


350 


ATGTATTCTC 


AATCACTGGA 


CGTGGTACAG 


TTGCTTCAGG 


ACGTATCGAC 


400 


CGTGGTACTG 


TTCGTGTCAA 


CGACGAAGTT 


GAAATCGTTG 


GTATTAAAGA 


450 


AGATATCCAA 


AAAGCAGTTG 


TTACTGGTGT 


TGAAATGTTC 


CGTAAACAAC 


500 


TTGACGAAGG 


TCTTGCAGGG 


GACAACGTTG 


GTGTTCTTCT 


TCGTGGTGTT 


550 


CAACGTGATG 


AAATCGAACG 


TGGTCAAGTT 


CTTGCTAAAC 


CAGGTTCAAT 


600 


CAACCCACAC 


ACTAAATTTA 


AAGGTGAAGT 


TTACATCCTT 


TCTAAAGAAG 


650 


AAGGTGGACG 


TCATACTCCA 


TTCTTCAACA 


ACTACCGTCC 


ACAATTCTAC 


700 


TTCCGTACAA 


CTGACGTAAC 


AGGTTCAATC 


GAACTTCCAG 


CAGGAACAGA 


750 


AATGGTTATG 


CCTGGTGATA 


ACGTTACTAT 


CGAAGTTGAA 


TTGATTCACC 


800 


CAATCGCCGT 


AGAACAA 








817 



35 



2) INFORMATION FOR SEQ ID NO: 1732 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 817 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



45 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 
50 (B) STRAIN: ATCC 27336 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1732 
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CGGAGCTATC 


CTTGTAGTAG 


CTTCAACTGA 


CGGACCAATG 


CCACAAACTC 


50 


GTGAGCACAT 


CCTTCTTTCA 


CGTCAGGTTG 


GTGTTAAACA 


CCTTATCGTC 


100 


TTCATGAACA 


AAGTTGACTT 


GGTTGACGAC 


GAAGAATTGC 


TTGAATTGGT 


150 


TGAAATGGAA 


ATCCGTGACC 


TATTGTCAGA 


ATACGACTTC 


CCAGGTGACG 


200 


ATCTTCCAGT 


TATCCAAGGT 


TCAGCACTTA 


AAGCTCTTGA 


AGGTGACTCT 


250 


AAATACGAAG 


ACATCGTTAT 


GGAATTGATG 


AACACAGTTG 


ATGAGTATAT 


300 


CCCAGAACCA 


GAACGTGACA 


CTGACAAACC 


ATTGCTTCTT 


CCAGTCGAGG 


350 


ACGTATTCTC 


AATCACTGGA 


CGTGGTACAG 


TTGCTTCAGG 


ACGTATCGAC 


400 


CGTGGTATCG 


TTAAAGTCAA 


CGACGAAATC 


GAAATCGTTG 


GTATCAAAGA 


450 


AGAAACTCRA 


AAAGCAGTTG 


TTACTGGTGT 


TGAAATGTTC 


CGTAAACAAC 


500 


TTGACGAAGG 


TCTTGCTGGA 


GATAACGTAG 


GTGTCCTTCT 


TCGTGGTGTT 


550 


CAACGTGATG 


AAATCGAACG 


TGGACAAGTT 


ATCGCTAAAC 


CAGGTTCAAT 


600 


CAACCCACAC 


ACTAAATTCA 


AAGGTGAAGT 


CTACATCCTT 


ACTAAAGAAG 


650 


AAGGTGGACG 


TCACACTCCA 


TTCTTCAACA 


ACTACCGTCC 


ACAATTCTAC 


700 


TTCCGTACTA 


CTGACGTTAC 


AGGTTCAATC 


GAACTTCCAG 


CAGGTACTGA 


750 


AATGGTAATG 


CCTGGTGATA 


ACGTGACAAT 


CGACGTTGAG 


TTGATTCACC 


800 


CAATCGCCGT 


AGAACAA 








817 



20 



2) INFORMATION FOR SEQ ID NO: 1733 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 817 bases 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



30 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus salivarius 

(B) STRAIN: ATCC 7073 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1733 



CGGTGCGATC 
GTGAGCACAT 
TTCATGAACA 

40 TGAAATGGAA 
ACATTCCAGT 
AAATACGAAG 
CCCAGAACCA 
ACGTATTCTC 

45 CGTGGTGTTG 
AGACATCCAA 
TTGACGRAGG 
CAACGTGATG 
CAACCCACAC 

50 AAGGTGGACG 
TTCCGTACAA 
AATGGTTATG 
CAATCGCCGT 



CTTGTAGTAG 
CCTTCTTTCA 
AAGTTGACTT 
ATCCGTGACC 
TATCCAAGGT 
ACATCATCAT 
GAACGTGACA 
AATCACTGGT 
TTCGTGTCAA 
AAAGCAGTTG 
TATTGCCGGA 
AAATCGAACG 
ACTAAATTCA 
TCACACTCCA 
CTGACGTAAC 
CCTGGTGATA 
TGAACAA 



CATCTACTGA 
CGTCAGGTTG 
GGTTGACGAT 
TTCTTTCAGA 
TCAGCTCTTA 
GGACTTGATG 
CTGACAAACC 
CGTGGTACTG 
TGACGAAGTT 
TTACTGGTGT 
GATAACGTCG 
TGGTCAAGTA 
AAGGTGAAGT 
TTCTTCAACA 
AGGTTCAATC 
ACGTGACTAT 



CGGACCAATG 
GTGTTAAACA 
GAAGAATTGC 
ATACGATTTC 
AAGCTCTTGA 
AACACTGTTG 
ATTGTTGCTT 
TTGCTTCAGG 
GAAATCGTTG 
TGAAATGTTC 
GTGTTCTTCT 
TTGGCTGCAC 
TTACATCCTT 
ACTACCGTCC 
GAACTTCCTG 
CGACGTTGAG 



CCACAAACTC 50 

CCTTATCGTC 100 

TTGAATTGGT 150 

CCAGGTGATG 200 

AGGTGATTCT 250 

ACGAATACAT 300 

CCAGTCGAAG 350 

ACGTATCGAC 400 

GTCTTAAAGA 450 

CGTAAACAAC 500 

TCGTGGTATC 550 

CTGGTTCAAT 600 

TCTAAAGAAG 650 

ACAGTTCTAC 700 

CAGGTACTGA 750 

TTGATCCACC 800 
817 



902 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

2) INFORMATION FOR SEQ ID NO: 1734 



PCT/CAOO/01150 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 897 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



10 



15 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Agrobacterium tumefaciens 
(C) ACCESSION NUMBER: x9?673 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1734 



AACATGATCA 
GGCTGCCGAC 
GTCAGGTGGG 

20 GTTGACGACG 
TCTGTCGTCC 
CGGCACTTGC 
ATCCGCGAGC 
TCCGATCGAC 

25 CGGGTCGTGG 
GTTGGTGAAG 
TGTTACCGGC 
GCGACAACAT 
CGTGGTCAGA 

30 CATGGCAGAA 
CGTTCTTCAC 
ACCGGTATCG 
CAACGTCACT 
AGCTGCGCTT 

35 



CCGGTGCTGC 
GGCCCGATGC 
CGTTCCGGCC 
CCGAGCTTCT 
TACGACTTCC 
TGCTCTTGAA 
TGATGGCTGC 
CAGCCGTTCC 
TACGGTTGTG 
AAGTCGAAAT 
GTTGAAATGT 
CGGTGCACTC 
TCCTGTGCAA 
GCCTACATCC 
GAACTACCGT 
TTTCGCTTCC 
GTTGAAGTCG 
CGCTATCCGC 



CGAGATGGAC 
CACAGACCCG 
ATCGTCGTGT 
CGAGCTCGTC 
CGGGCGACGA 
GATTCTGACA 
TGTCGACGCC 
TGATGCCGAT 
ACGGGTCGCG 
CGTCGGCATC 
TCCGCAAGCT 
GTTCGCGGCG 
GCCGGGTTCG 
TGACGAAGGA 
CCGCAGTTCT 
TGAAGGCACG 
AGCTGATCGT 
GAAGGCGGCC 



GGCGCGATCC 
CGAGCACATC 
TCCTCAACAA 
GAGCTTGAAG 
TATCCCGATC 
AGAAGATCGG 
TACATCCCGA 
CGAAGACGTG 
TTGAGCGCGG 
CGTCCGACCT 
GCTCGACCAG 
TTACCCGTGA 
GTCAAGCCGC 
AGAAGGCGGC 
ACTTCCGTAC 
GAAATGGTTA 
TCCGATCGCG 
GTACCGTCGG 



TGGTTTGCTC 50 

CTGCTTGCCC 100 

GGTCGACCAG 150 

TTCGCGAACT 200 

ATCAAGGGTT 250 

TGAAGACGCG 300 

CGCCTGAGCG 350 

TTCTCGATCT 4 00 

TATCGTCAAG 4 50 

CGAAGACGAC 500 

GGCCAGGCCG 550 

CGGCGTCGAG 600 

ACAAGAAGTT 650 

CGTCATACGC 700 

GACTGACGTT 750 

TGCCTGGCGA 800 

ATGGAAGAAA 850 

CGCCGGC 8 97 



2) INFORMATION FOR SEQ ID NO: 1735 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 885 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



45 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus subtilis 
50 (B) STRAIN: 168 

(C) ACCESSION NUMBER: Z99104 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1735 

903 
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ATGATCACTG 


GTGCTGCGCA 


AATGGACGGA 


GCTATCCTTG 


TAGTATCTGC 


50 


TGCTGATGGC 


CCAATGCCAC 


AAACTCGTGA 


GCACATCCTT 


CTTTCTAAAA 


100 


ACGTTGGTGT 


ACCATACATC 


GTTGTATTCT 


TAAACAAATG 


CGACATGGTA 


150 


GACGACGAAG 


AGCTTCTTGA 


ACTAGTTGAA 


ATGGAAGTTC 


GCGATCTTCT 


200 


TAGCGAATAC 


GACTTCCCTG 


GTGATGATGT 


ACCAGTTGTT 


AAAGGTTCTG 


250 


CTCTTAAAGC 


TCTTGAAGGA 


GACGCTGAGT 


GGGAAGCTAA 


AATCTTCGAA 


300 


CTTATGGATG 


CGGTTGATGA 


GTACATCCCA 


ACTCCAGAAC 


GCGACACTGA 


350 


AAAACCATTC 


ATGATGCCAG 


TTGAGGACGT 


ATTCTCAATC 


ACTGGTCGTG 


400 


GTACAGTTGC 


TACTGGCCGT 


GTAGAACGCG 


GACAAGTTAA 


AGTCGGTGAC 


450 


GAAGTTGAAA 


TCATCGGTCT 


TCAAGAAGAG 


AACAAGAAAA 


CAACTGTTAC 


500 


AGGTGTTGAA 


ATGTTCCGTA 


AGCTTCTTGA 


TTACGCTGAA 


GCTGGTGACA 


550 


ACATTGGTGC 


CCTTCTTCGC 


GGTGTATCTC 


GTGAAGAAAT 


CCAACGTGGT 


600 


CAAGTACTTG 


CTAAACCAGG 


TACAATCACT 


CCACACAGCA 


AATTCAAAGC 


650 


TGAAGTTTAC 


GTTCTTTCTA 


AAGAAGAGGG 


TGGACGTCAT 


ACTCCATTCT 


700 


TCTCTAACTA 


CCGTCCTCAG 


TTCTACTTCC 


GTACAACTGA 


CGTAACTGGT 


750 


ATCATCCATC 


TTCCAGAAGG 


CGTAGAAATG 


GTTATGCCTG 


GAGATAACAC 


800 


TGAAATGAAC 


GTTGAACTTA 


TTTCTACAAT 


CGCTATCGAA 


GAAGGAACTC 


850 


GTTTCTCTAT 


TCGTGAAGGC 


GGACGTACTG 


TTGGT 




885 



20. 



2) INFORMATION FOR SEQ ID NO: 1736 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 882 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



30 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacteroides fragilis 
35 (B) STRAIN: DSM 2151 

(C) ACCESSION NUMBER: P33165 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1736 



40 


ATGGTTACTG 


GTGCTGCTCA 


GATGGACGGT 


GCTATCATTG 


TAGTTGCTGC 


50 




TACTGATGGT 


CCGATGCCTC 


AGACTCGTGA 


GCACATCCTT 


TTGGCTCGTC 


100 




AGGTAAACGT 


TCCGAAGCTG 


GTTGTATTCA 


TGAACAAGTG 


CGATATGGTT 


150 




GAAGATGCTG 


AGATGTTGGA 


ACTTGTTGAA 


ATGGAAATGA 


GAGAATTGCT 


200 




TTCATTCTAT 


GATTTCGACG 


GTGACAATAC 


TCCGATCATT 


CAGGGTTCTG 


250 


45 


CTCTTGGTGC 


ATTGAACGGC 


GTAGAAAAAT 


GGGAAGACAA 


AGTAATGGAA 


300 




CTGATGGAAG 


CTGTTGATAC 


TTGGATTCCA 


CTGCCTCCGC 


GCGATGTTGA 


350 




TAAACCTTTC 


TTGATGCCGG 


TAGAAGACGT 


GTTCTCTATC 


ACAGGTCGTG 


400 




GTACTGTAGC 


TACAGGTCGT 


ATCGAAACTG 


GTGTTATCCA 


TGTAGGTGAT 


450 




GAAATCGAAA 


TCCTCGGTTT 


GGGTGAAGAT 


AAGAAATCAG 


TTGTAACAGG 


500 


50 


TGTTGAAATG 


TTCCGCAAAC 


TTCTGGATCA 


GGGTGAAGCT 


GGTGACAACG 


550 




TAGGTCTGTT 


GCTTCGTGGT 


GTTGACAAGA 


ACGAAATCAA 


ACGTGGTATG 


600 




GTTCTTTGTA 


AACCGGGTCA 


GATTAAACCT 


CACTCTAAAT 


TCAAAGCAGA 


650 




GGTTTATATC 


CTGAAGAAAG 


AAGAAGGTGG 


TCGTCACACT 


CCATTCCATA 


700 
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ACAAATATCG TCCTCAGTTC TACCTGCGTA 

ATCACTCTTC CGGAAGGAAC TGAAATGGTA 

TATCACTGTA GAGTTGATCT ATCCGGTTGC 

TCGCTATCCG CGAAGGTGGA CGTACAGTAG 



CTATGGACTG TACAGGTGAA 750 

ATGCCGGGTG ATAACGTAAC 800 

ACTGAACATC GGTCTTCGTT 850 

GT 882 



2) INFORMATION FOR SEQ ID NO: 1737 



10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 888 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double, 

(D) TOPOLOGY: Linear 



15 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Borrelia burgdorferi 
20 (B) STRAIN: U78183 

(C) ACCESSION NUMBER: U78183 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1737 



25 


AATATGATTA 


CAGGAGCAGC 


TCAAATGGAT 


GCAGCGATAC 


TTTTAGTTGC 


50 




TGCTGATAGT 


GGTGCTGAGC 


CTCAAACAAA 


AGAGCATTTG 


CTTCTTGCTC 


100 




AAAGAATGGG 


AATAAAGAAA 


ATAATAGTTT 


TTTTAAATAA 


ATTGGACTTA 


150 




GCAGATCCTG 


AACTTGTTGA 


GCTTGTTGAA 


GTTGAAGTTT 


TAGAACTTGT 


200 




TGAAAAATAT 


GGCTTTTCAG 


CTGATACTCC 


AATAATCAAA 


GGTTCAGCTT 


250 


30 


TTGGGGCTAT 


GTCAAATCCA 


GAAGATCCTG 


AATCTACAAA 


ATGCGTTAAA 


300 




GAACTTCTTG 


AATCTATGGA 


TAATTATTTT 


GATCTTCCAG 


AAAGAGATAT 


350 




TGACAAGCCA 


TTTTTGCTTG 


CTGTTGAAGA 


TGTATTTTCT 


ATTTCAGGAA 


400 




GAGGCACTGT 


TGCTACTGGG 


CGTATTGAAA 


GAGGTATTAT 


TAAAGTTGGT 


450 




CAAGAAGTTG 


AAATAGTTGG 


AATTAAAGAA 


ACCAGAAAAA 


CTACTGTTAC 


500 


35 


TGGTGTTGAA 


ATGTTCCAGA 


AAATTCTTGA 


GCAAGGTCAA 


GCAGGGGATA 


550 




ATGTTGGTCT 


TCTTTTGAGA 


GGCGTTGATA 


AAAAAGACAT 


TGAGAGGGGG 


600 




CAAGTTTTGT 


CAGCTCCAGG 


TACAATTACT 


CCACACAAGA 


AATTTAAAGC 


650 




TTCAATTTAT 


TGTTTGACTA 


AAGAAGAAGG 


CGGTAGGCAC 


AAGCCATTTT 


700 




TCCCAGGGTA 


TAGACCACAG 


TTCTTTTTTA 


GAACAACCGA 


TGTTACTGGA 


750 


40 


GTTGTTGCTT 


TAGAGGGCAA 


AGAAATGGTT 


ATGCCTGGTG 


ATAATGTTGA 


. 800 




TATTATTGTT 


GAGCTGATCT 


CTTCAATAGC 


TATGGATAAG 


AATGTAGAAT 


850 




TTGCTGTTCG 


AGAAGGTGGA 


AGAACCGTTG 


CTTCAGGA 




888 



45 

2) INFORMATION FOR SEQ ID NO: 1738 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 894 bases 
50 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



905 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

(ii) MOLECULE TYPE: Genomic DNA 



PCT/CA00/01150 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Brevibacterium linens 
5 (B) STRAIN: DSM 20425 

(C) ACCESSION NUMBER: X76863 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17 38 



10 


AACATGATCA 


CCGGTGCCGC 


TCAGATGGAC 


GGTGCGATCC 


TCGTCGTCGC 


50 




CGCTACCGAC 


GGACCGATGC 


CCCAGACCCG 


TGAGCACGTG 


CTGCTCGCGC 


100 




GTCAGGTCGG 


CGTTCCCTAC 


ATCGTCGTGG 


CTCTGAACAA 


GTCCGACATG 


150 




GTCGATGACG 


AGGAGCTCCT 


CGAGCTCGTC 


GAATTCGAGG 


TCCGCGACCT 


200 








ACGGAGACAA 


CGCTCCGGTC 


ATTCCGGTGT 


250 


15 


CCGCTCTCAA 


GGCGCTGGAA 


GC-CGACGAGA 


AGTGGGTCAA 


GAGCGTTCAG 


300 




GATCTCATGG 


CTGCCGTCGA 


TGACAACGTT 


CCGGAGCCGG 


AGCGCGATGT 


350 




CGACAAGCCG 


TTCCTCATGC 


CCGTCGAGGA 


CGTCTTCACG 


ATCACCGGTC 


400 




GTGGAACCGT 


CGTCACCGGT 


CGTGTCGAGC 


GCGGCGTGCT 


CCTGCCTAAC 


450 




GACGAAATCG 


AAATCGTCGG 


CATCAAGGAG 


AAGTCGTCCA 


AGACGACTGT 


500 


20 


CACCGCTATC 


GAGATGTTCC 


GCAAGACCCT 


GCCGGATGCC 


CGTGCAGGTG 


550 




AGAACGTCGG 


TCTGCTCCTC 


CGCGGCACCA 


AGCGCGAGGA 


TGTTGAGCGC 


600 




GGTCAGGTCA 


TCGTGAAGCC 


GGGTTCGATC 


ACCCCGCACA 


CCAAGTTCGA 


650 




GGCTCAGGTC 


TACATCCTGA 


GCAAGGACGA 


GGGCGGACGT 


CACAACCCGT 


700 




TCTACTCGAA 


CTACCGTCCG 


CAGTTCTACT 


TCCGGACCAC 


GGACGTCACC 


750 


25 


GGTGTCATCA 


CGCTGCCCGA 


GGGCACCGAG 


ATGGTCATGC 


CCGGCGACAA 


800 




CACCGATATG 


TCGGTCGAGC 


TCATCCAGCC 


GATCGCTATG 


GAGGACCGCC 


850 




TCCGCTTCGC 


AATCCGCGAA 


GGTGGCCGCA 


CCGTCGGCGC 


CGGT 


894 



30 



40 



45 



2) INFORMATION FOR SEQ ID NO: 1739 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 888 bases 
35 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Chlamydia trachomatis 

(B) STRAIN : F/IC-Cal-13 

(C) ACCESSION NUMBER: L22216 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1739 



ATGATCACGG GCGCAGCGCA GATGGACGGC GCGATCCTGG TTTGCTCGGC 50 

AGCAGACGGC CCGATGCCGC AAACGCGTGA GCACATCCTG CTGGCGCGTC 100 

50 AGGTTGGTGT TCCGTACATC ATCGTGTTCC TGAACAAGTG CGACAGTGTG 150 

GACGACGCTG AACTGCTCGA GCTGGTCGAG ATGGAAGTTC GCGAACTCCT 200 

GTCGAAGTAC GACTTCCCGG GCGACGACAC GCCGATCGTG AAGGGTTCGG 250 

CCAAGCTGGC GCTGGAAGGC GACACGGGCG AGCTGGGCGA AGTGGCGATC 300 

906 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CAOO/01150 



ATGAGCCTGG 


CAGACGCGCT 


GGACACGTAC 


ATCCCGACGC 


CGGAGCGTGC 


350 


AGTTGACGGC 


GCGTTCCTGA 


TGCCGGTGGA 


AGACGTGTTC 


TCGATCTCGG 


400 


GCCGTGGTAC 


GGTGGTGACG 


GGTCGTGTCG 


AGCGCGGCAT 


CGTGAAGGTC 


450 


GGCGAAGAAA 


TCGAAATCGT 


CGGTATCAAG 


CCGACGGTGA 


AGACGACCTG 


500 


CACGGGCGTT 


GAAATGTTCC 


GCAAGCTGCT 


GGACCAAGGT 


CAGGCAGGCG 


550 


ACAACGTCGG 


TATCCTGCTG 


CGCGGCACGA 


AGCGTGAAGA 


CGTGGAGCGT 


600 


GGCCAGGTTC 


TGGCGAAGCC 


GGGTTCGATC 


ACGCCGCACA 


CGCACTTCAC 


650 


GGCTGAAGTG 


TACGTGCTGA 


GCAAGGACGA 


AGGCGGCCGT 


CACACGCCGT 


700 


TCTTCAACAA 


CTACCGTCCG 


CAGTTCTACT 


TCCGTACGAC 


GGACGTGACG 


750 


GGCTCGATCG 


AGCTGCCGAA 


GGACAAGGAA 


ATGGTGATGC 


CGGGCGACAA 


800 


CGTGTCGATC 


ACGGTGAAGC 


TGATTGCTCC 


GATCGCGATG 


GAAGAAGGTC 


850 


TGCGCTTCGC 


AATCCGTGAA 


GGCGGCCGTA 


CGGTCGGC 




888 



15 



2) INFORMATION FOR SEQ ID NO: 1740 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 891 bases 
20 (B) TYPE: Nucleic acid 

' (C) STRANDEDNESS : Double 
(D) TOPOLOGY: Linear 



25 



30 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Fibrobacter succinogenes 

(B) STRAIN: S85 

(C) ACCESSION NUMBER: X76866 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1740 



AACATGGTGA 
CGCTACTGAC 

35 ACCAGGTTGG 
GTTGACGATG 
CCTCTCCAAG 
CCGCTCTCAA 
GAACTCATGA 

40 CGACAAGCCG 
GCGGCACTGT 
GACAAGGTTG 
CGGTGTTGAA 
ACGTTGGTCT 

45 ATGGTTCTCG 
TGAAATCTAC 
TGAATGGCTA 
ACGATCCAGC 
CACGATCCAC 

50 GCTTCGCTAT 



CTGGTGCTGC 
GGTCCGATGC 
CGTGCCGAAG 
CTGAAATTCT 
TATGACTTCG 
GGCCCTCGAA 
ACGCTTGCGA 
TTCCTCATGC 
CGCTACTGGC 
AACGTATCGG 
ATGTTCCGTA 
CCTCCTCCGT 
CAGCTCCGAA 
GTTCTCACGA 
CCGTCCGCAG 
TCCCGGAAGG 
GTGAACCTCA 
CCGTGAAGGT 



TCAGATGGAC 
CGCAGACTCG 
ATCGTCGTGT 
CGACCTCGTC 
ACGGTGACAA 
GGCGATCCGG 
CGAATACATC 
CGATCGAAGA 
CGTATCGAAC 
TCTCGGTGAA 
AGCTCCTCGA 
GGTGCTGAAA 
GTCTGTCACT 
AGGACGAAGG 
TTCTACTTCC 
TGTCGAAATG 
TCGCTCCGAT 
GGACGTACTG 



GGCGCTATCC 
CGAACACATC 
TCATGAACAA 
GAAATGGAAG 
CACCCCGATC 
AATACCAGGA 
CCGCTCCCGC 
CGTGTTCACG 
GCGGTGTCGT 
ACCACCGAAT 
CGACGCTCAG 
AGAAGGACAT 
CCGCACACCG 
TGGCCGTCAC 
GCACCACCGA 
GTTACTCCGG 
CGCTATGGAA 
TTGGTGCTGG 



TCGTTGTTGC 50 

CTTCTCGCTC 100 

GTGCGACATG 150 

TTCGCGAACT 200 

ATCCGTGGTT 250 

CAAGGTCATG 300 

AGCGCGATAC 350 

ATTACTGGCC 400 

TCGCTTGAAC 450 

ACGTCATCAC 500 

GCAGGTGACA 550 

CGTCCGTGGC 600 

AATTTAAGGC 650 

ACGCCGTTCA 700 

CGTTACTGGT 750 

GTGACACGGT 800 

AAGCAGCTCC 850 

C 891 



907 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CAOO/01150 



2) INFORMATION FOR SEQ ID NO: 1741 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 894 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

10 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Flavobacterium ferrugineum 

(B) STRAIN: DSM 13524 

(C) ACCESSION NUMBER: X76867 

15 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1741 





AACATGATCA 


CCGGTGCTGC 


CCAGATGGAC 


GGTGCTATCT 


TAGTTGTGGC 


50 




TGCATCAGAC 


GGTCCTATGC 


CTCAAACAAA 


AGAACACATC 


CTGCTTGCTG 


100 


20 


CCCAGGTAGG 


TGTACCTAAA 


ATGGTTGTGT 


TTCTGAATAA 


AGTTGACCTC 


150 




GTTGACGACG 


AAGAGCTCCT 


GGAGCTGGTT 


GAGATCGAGG 


TTCGCGAAGA 


200 




ACTGACTAAA 


CGCGGTTTCG 


ACGGCGACAA 


CACTCCAATC 


ATCAAAGGTT 


250 




CCGCTACAGG 


CGCCCTCGCT 


GGTGAAGAAA 


AGTGGGTTAA 


AGAAATTGAA 


300 




AACCTGATGG 


ACGCTGTTGA 


CAGCTACATC 


CCACTGCCTC 


CTCGTCCGGT 


350 


25 


TGATCTGCCG 


TTCCTGATGA 


GCGTAGAGGA 


CGTATTCTCT 


ATCACTGGTC 


400 




GTGGTACTGT 


TGCTACCGGT 


CGTATCGAGC 


GTGGCCGTAT 


CAAAGTTGGT 


450 




GAGCCTGTTG 


AGATCGTAGG 


TCTGCAGGAG 


TCTCCCCTGA 


ACTCTACCGT 


500 




TACAGGTGTT 


GAGATGTTCC 


GCAAACTCCT 


CGACGAAGGT 


GAAGCTGGTG 


550 




ATAACGCCGG 


TCTCCTCCTC 


CGTGGTGTTG 


AAAAAACACA 


GATCCGTCGC 


600 


30 


GGTATGGTAA 


TCGTTAAACC 


CGGTTCCATC 


ACTCCGCACA 


CGGACTTCAA 


650 




AGGCGAAGTT 


TACGTACTGA 


GCAAAGACGA 


AGGTGGCCGT 


CACACTCCAT 


700 




TCTTCAACAA 


ATACCGTCCT 


CAATTCTACT 


TCCGTACAAC 


TGACGTTACA 


750 




GGTGAAGTAG 


AACTGAACGC 


AGGAACAGAA 


ATGGTTATGC 


CTGGTGATAA 


800 




CACCAACCTG 


ACCGTTAAAC 


TGATCCAACC 


GATCGCTATG 


GAAAAAGGTC 


850 


35 


TGAAATTCGC 


GATCCGCGAA 


GGTGGCCGTA 


CCGTAGGTGC 


AGGA 


894 



2) INFORMATION FOR SEQ ID NO: 1742 

40 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 906 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
45 (D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 
50 (A) ORGANISM: Helicobacter pylori 

(B) STRAIN: 26695 

(C) ACCESSION NUMBER: AE000626 



908 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1742 



AACATGATCA 
TGCAGCTGAT 
5 GTCAAGTAGG 
GTAGATGACC 
GTTGAGCGCG 
CAGCTTTAAG 
GGTGAAAAAG 

10 TCCAGAAAGA 
TCTCTATTGC 
GTGGTGAAAG 
AAAAACGACT 
GTGAAGCCGG 

15 GAAGTGGAAC 
CAAGAAATTT 
GACACACTCC 
ACTGATGTGA 
GCCTGGCGAT 

20 TAGAGTTGGG 
GCTGGT 



CCGGTGCGGC 
GGCCCTATGC 
CGTGCCTCAC 
AAGAATTGTT 
TATGAATTTC 
AGCTTTAGAA 
TGCTTAAACT 
GACACTGAAA 
GGGTAGAGGG 
TAGGCGATGA 
GTAACCGGTG 
CGATAATGTG 
GCGGTATGGT 
GAGGGAGAAA 
ATTCTTCACC 
CTGGCTCTAT 
AATGTGAAAA 
AACTAAATTT 



GCAAATGGAC 
CTCAAACTAG 
ATCGTTGTTT 
AGAACTTGTA 
CTGGCGATGA 
GAAGCAAAGG 
TATGGCTGAA 
AAACTTTCTT 
ACTGTGGTTA 
AGTGGAAATC 
TAGAAATGTT 
GGCGTGCTTT 
TCTATGCAAA 
TT'TATGTCCT 
AATTACCGCC 
CACCCTTCCT 
TCACTGTAGA 
GCGATTCGTG 



GGAGCGATTT 
GGAGCATATC 
TCTTAAACAA 
GAAATGGAAG 
CACTCCTATC 
CTGGTAATGT 
GTGGATGCCT 
GATGCCGGTT 
CAGGTAGGAT 
GTTGGTATCA 
TAGGAAAGAG 
TGAGAGGAAC 
CCAGGTTCTA 
TTCTAAAGAA 
CGCAATTCTA 
GAAGGCGTAG 
GTTGATTAGC 
AAGGCGGTAG 



PCT/CAOO/01150 



TGGTTGTTTC 50 

TTATTGTCTC 100 

ACAAGACATG 150 

TGCGCGAATT 200 

GTAGCGGGTT 250 

GGGTGAATGG 300 

ATATCCCTAC 350 

GAAGATGTGT 400 

TGAAAGAGGC 450 

GACCTACACA 500 

TTGGAAAAAG 550 

TAAAAAAGAA 600 

TCACTCCGCA 650 

GAAGGCGGGA 700 

TGTGCGCACA 750 

AAATGGTTAT 800 

CCTGTTGCGT 850 

GACCGTTGGT 900 
906 



25 2) INFORMATION FOR SEQ ID NO: 1743 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 891 bases 

(B) TYPE: Nucleic acid 
30 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



35 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Micrococcus luteus 

(B) STRAIN: IFO 3333 

(C) ACCESSION NUMBER: M17788 

40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1743 



AACATGATCA 
CGCTACCGAC 
GCCAGGTCGG 

4 5 GTGGAGGACG 
GCTGTCCTCC 
CCGCTCTGAA 
GACCTCATGG 
GGACAAGCCG 

50 GTGGCACCGT 
TCCGAGGTCG 
CGGCATCGAG 
ACTGCGGTCT 



CCGGCGCCGC 
GGCCCGATGG 
CGTGCCGGCC 
AGGAGCTCCT 
AGGAGCTTCG 
GGCCCTCGAG 
ATGCCGTGGA 
TTCCTGATGC 
GGTGACCGGT 
AGATCGTCGG 
ATGTTCCACA 
GCTCGTGCGC 



TCAGATGGAC 
CCCAGACCCG 
CTGCTCGTGG 
CGAGCGTGTC 
ACGTCGACGA 
GGCGACCCCC 
CGAGTACATC 
CGATCGAGGA 
CGCGCCGAGC 
CATCCGCGAC 
AGCAGCTCGA 
GGTCTGAAGC 



GGCGCGATCC 
TGAGCACGTG 
CCCTGAACAA 
GAGATGGAGG 
GGCCCCGGTC 
AGTGGGTCAA 
CCGGACCCGG 
CGTCTTCACG 
GCGGCACCCT 
GTGCAGAAGA 
CGAGGCCTGG 
GCGACGACGT 



TCGTGGTCGC 50 

CTCCTGGCCC 100 

GTCGGACATG 150 

TCCGGCAGCT 200 

ATCCGCACCT 250 

GTCCGTCGAG 300 

TGCGCGACAA 350 

ATCACCGGCC 400 

GAAGATCAAC 450 

CCACTGTCAC 500 

GCCGGCGAGA 550 

CGAGCGCGGC 600 



909 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

CAGGTGCTGG TGGAGCCGGG CTCCATCACC 

GAACGTCTAC ATCCTGTCCA AGGACGAGGG 

ACTCGAACTA CCGCGCGCAG TTCTACTTCC 

GTCATCACGC TGCCCGAGGG CACCGAGATG 

5 CGAGATGTCG GTCGAGCTCA TCCAGCCGAT 

GCTTCGCCAT CCGCGAGGGT GGCCGCACCG 



PCT/CA00/01150 



CCGCACACCA 


ACTTCGAGGC 


650 


TGGGCGTCAC 


ACCCCGTTCT 


700 


GCACCACCGA 


CGTCACCGGC 


750 


GTCATGCCCG 


GCGACACCAC 


800 


CGCCATGGAG 


GAGGGCCTCG 


850 


TGGGCTCCGG 


C 


891 



10 2) INFORMATION FOR SEQ ID NO: 1744 



15 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 891 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY:. Linear 



(ii) MOLECULE TYPE: Genomic DNA 

20 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mycobacterium tuberculosis 

(B) STRAIN: Erdmann 

(C) ACCESSION NUMBER: X63539 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1744 



AACATGATCA 
CGCCACCGAC 
GTCAAGTGGG 

30 GTGGACGACG 
GCTGGCTGCC 
CGCTCAAGGC 
CTGATGAACG 
CAAGCCGTTC 

35 GAACCGTGGT 
GAAGTTGAGA 
CGGTGTGGAG 
ACGTTGGTTT 
CAGGTTGTCA 

40 CCAGGTCTAC 
TCAACAACTA 
GTGGTGACAC 
CAACATCTCG 
GTTTCGCGAT 

45 



CCGGCGCCGC 
GGCCCGATGC 
TGTGCCCTAC 
AGGAGCTGCT 
CAGGAATTCG 
GCTCGAGGGT 
CGGTCGACGA 
CTGATGCCGG 
CACCGGACGT 
TCGTCGGCAT 
ATGTTCCGCA 
GCTGCTGCGG 
CCAAGCCCGG 
ATCCTGTCCA 
CCGTCCGCAG 
TGCCGGAGGG 
GTGAAGTTGA 
CCGCGAGGGT 



GCAGATGGAC 
CCCAGACCCG 
ATCCTGGTAG 
CGAACTCGTC 
ACGAGGACGC 
GACGCGAAGT 
GTCGATTCCG 
TCGAGGACGT 
GTGGAGCGCG 
TCGCCCATCG 
AGCTGCTCGA 
GGCGTCAAGC 
CACCACCACG 
AGGACGAGGG 
TTCTACTTCC 
CACCGAGATG 
TCCAGCCCGT 
GGCCGCACCG 



GGTGCGATCC 
CGAGCACGTT 
CGCTGAACAA 
GAGATGGAGG 
CCCGGTTGTG 
GGGTTGCCTC 
GACCCGGTCC 
CTTCACCATT 
GCGTGATCAA 
ACCACCAAGA 
CCAGGGCCAG 
GCGAGGACGT 
CCGCACACCG 
CGGCCGGCAC 
GCACCACCGA 
GTGATGCCCG 
CGCCATGGAC 
TGGGCGCCGG 



TGGTGGTCGC 50 

CTGCTGGCGC 100 

GGCCGACGCA 150 

TCCGCGAGCT 200 

CGGGTCTCGG 250 

TGTCGAGGAA 300 

GCGAGACCGA 350 

ACCGGCCGCG 4 00 

CGTGAACGAG 4 50 

CCACCGTCAC 500 

GCGGGCGACA 550 

CGAGCGTGGC 600 

AGTTCGAAGG 650 

ACGCCGTTCT 700 

CGTGACCGGT 750 

GTGACAACAC 800 

GAAGGTCTGC 850 

C 891 



2) INFORMATION FOR SEQ ID NO: 1745 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 891 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

910 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

(D) TOPOLOGY: Linear 



PCT/CAOO/01150 



(ii) MOLECULE TYPE: Genomic DNA 



5 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mycoplasma genitalium 

(B) STRAIN: G37 

(C) ACCESSION NUMBER: U39732 



10 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1745 



AATATGATCA 
AGCAACTGAT 
GCCAAGTAGG 

15 GCTAGTGATG 
GTTAACTTCC 
CAGCTTTAAA 
GATTTGATTA 
AGATAAACCT 

20 GAGGTACAGT 
CAAGAAGTTG 
TGGAATTGAA 
ATGCTGGGGT 
CAAGTTTTAG 

25 TGAGATCTAT 
TAAACGGTTA 
TCTATTGCTT 
TTCTATTACT 
AGTTCTCAAT 

30 



CAGGTGCTGC 
AGTGTGATGC 
GGTTCCTAAA 
AAGAGGTACA 
TATGGTTTTG 
AGCATTGGAA 
AAGCAGTTGA 
TTCTTATTAG 
TGTTACAGGA 
AAATTGTTGG 
ATGTTCAAAA 
ATTATTACGT 
CAAAACCAGG 
GCTTTAAAGA 
CCGTCCTCAA 
TAGCTGAAAA 
GTTGAGTTAA 
TCGTGAAGGT 



ACAAATGGAT 
CCCAAACCCG 
ATGGTAGTTT 
AGAACTTGTT 
ATGGTAAGAA 
GGTGATCCAA 
TGAATGGATT 
CAATTGAAGA 
AGAGTTGAAA 
TTTAAAACCA 
AGGAACTTGA 
GGTGTTGAAC 
CTCTATTAAA 
AAGAAGAAGG 
TTCTATTTCC 
TACTGAAATG 
TTGCTCCTAT 
GGTAGAACTG 



GGAGCTATTC 
CGAGCACATC 
TTCTAAACAA 
GCTGAAGAAG 
CACTCCTATT 
AGTGGGAGGC 
CCAACTCCTA 
TACGATGACC 
GAGGTGAACT 
ATTAGAAAAG 
TTCAGCAATG 
GTAAAGAAGT 
CCGCACAAGA 
TGGTAGACAC 
GTACCACTGA 
GTTCTACCTG 
CGCTTGTGAA 
TAGGGGCAGG 



TAGTTGTTTC 50 

TTACTTGCCC 100 

GTGTGATATT 150 

TACGTGATCT 200 

ATTTATGGCT 250 

TAAGATCCAT 300 

CACGTGAAGT 350 

ATTACTGGTA 400 

CAAAGTAGGT 450 

CAGTTGTTAC 500 

GCTGGTGACA 550 

TGAAAGAGGT 600 

AATTTAAAGC 650 

ACTGGTTTTT 700 

TGTAACTGGT 750 

GTGATAATGC 800 

AAAGGTAGTA 850 

C • 891 



2) INFORMATION FOR SEQ ID NO: 1746 



35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 891 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



40 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria gonorrhoeae 

(B) STRAIN: MS11 

(C) ACCESSION NUMBER: L36380 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1746 



AACATGATTA CCGGCGCCGC 

TGCTGCCGAC GGCCCTATGC 

GTCAAGTAGG CGTACCTTAC 

GTCGACGATG CCGAGCTGTT 



ACAAATGGAC GGTGCAATCC 

CGCAAACCCG CGAACACATC 

ATCATCGTGT TCATGAACAA 

CCAACTGGTT GAAATGGAAA 



TGGTATGTTC 50 

CTGCTGGCCC 100 

ATGCGACATG 150 

TCCGCGACCT 200 



911 
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GCTGTCCAGC 


TACGACTTCC 


CCGGCGACGA 


CTGCCCGATC 


GTACAAGGTT 


250 


CCGCACTGAA 


AGCCTTGGAA 


GGCGATGCCG 


CTTACGAAGA 


AAAAATCTTC 


300 


GAACTGGCTA 


CCGCATTGGA 


CAGATACATC 


CCGACTCCCG 


AGCGTGCCGT 


350 


GGACAAACCA 


TTCCTGCTGC 


CTATCGAAGA 


CGTGTTCTCC 


ATTTCCGGCC 


400 


GCGGTACCGT 


AGTCACCGGC 


CGTGTAGAGC 


GAGGTATCAT 


CCACGTTGGT 


450 


GACGAGATTG 


AAATCGTCGG 


TCTGAAAGAA 


ACCCAAAAAA 


CCACCTGTAC 


500 


CGGCGTTGAA 


ATGTTCCGCA 


AACTGCTGGA 


CGAAGGTCAG 


GCGGGCGACA 


550 


ACGTAGGCGT 


ATTGCTGCGC 


GGTACCAAAC 


GTGAAGACGT 


AGAACGCGGT 


600 


CAGGTATTGG 


CCAAACGGGG 


TACTATCACT 


CCTCACACCA 


AGTTCAAAGC 


650 


AGAAGTGTAC 


GTATTGAGCA 


AAGAAGAGGG 


CGGCCCCCAT 


ACCCCGTTTT 


700 


TCGCCAACTA 


CCGTCCCCAA 


TTCTACTTCC 


GTACCACTGA 


CGTAACCGGC 


750 


ACGATTACTT 


TGGAAAAAGG 


TGTGGAAATG 


GTAATGCCGG 


GTGAGAACGT 


800 


AACCATTACT 


GTAGAACTGA 


TTGCGCCTAT 


CGCTATGGAA 


GAAGGTCTGC 


850 


GCTTTGCGAT 


TCGCGAAGGC 


GGCCGTACCG 


TGGGTGCCGG 


C 


891 



15 



2) INFORMATION FOR SEQ ID NO: 1747 



20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 891 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



25 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Rickettsia prowazekii 
30 (B) STRAIN: Madrid E 

(C) ACCESSION NUMBER: Z54170 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1747 



35 


AATATGATAA 


CTGGTGCCGC 


TCAGATGGAT 


GGTGCTATAT 


TAGTAGTTTC 


50 




TGCTGCTGAT 


GGTCCTATGC 


CTCAAACTAG 


AGAACATATA 


TTACTGGCAA 


100 




AACAGGTAGG 


TGTACCTGCT 


ATGGTAGTAT 


TTTTGAATAA 


AGTAGATATG 


150 




GTAGATGATC 


CTGACCTATT 


AGAATTAGTT 


GAGATGGAAG 


TAAGAGAATT 


200 




ATTATCAAAA 


TATGGTTTCC 


CTGGTAATGA 


AATACCTATT 


ATTAAAGGTT 


250 


40 


CTGCACTTCA 


AGCTTTAGAA 


GGAAAACCTG 


AAGGTGAAAA 


AGCTATTAAT 


300 




GAGTTAATGA 


ATGCAGTAGA 


TACGTATATA 


CCTCAGCCTA 


TAGAGCTACA 


350 




AGATAAACCT 


TTTTTAATGC 


CAATAGAGGA 


TGTATTTTCT 


ATTTCAGGCA 


400 




GAGGTACCGT 


TGTAACTGGT 


AGAGTGGAGT 


CAGGCATAAT 


TAAGGTGGGT 


450 




GAAGAAATTG 


AAATAGTAGG 


TCTAAAAAAT 


ACGCAAAAAA 


CGACTTGTAC 


500 


45 


AGGTGTAGAA 


ATGTTCAGAA 


AATTACTTGA 


TGAAGGACAA 


TCTGGAGATA 


550 




ATGTCGGTAT 


ATTACTACGT 


GGTACAAAAA 


GAGAAGAAGT 


AGAAAGAGGA 


600 




CAAGTACTTG 


CAAAACCTGG 


GAGCATAAAA 


CCGCATGATA 


AATTTGAAGC 


650 




TGAAGTGTAT 


GTGCTTAGTA 


AAGAGGAAGG 


TGGACGTCAT 


ACCCCATTTA 


700 




CTAATGATTA 


TCGCCCACAG 


TTCTATTTTA 


GAACAACAGA 


TGTTACCGGC 


750 


50 


ACAATAAAAT 


TGCCTTCTGA 


TAAGCAGATG 


GTTATGCCTG 


GAGATAATGC 


800 




TACTTTTTCA 


GTAGAATTAA 


TTAAGCCGAT 


TGCTATGCAA 


GAAGGGTTAA 


850 




AATTCTCTAT 


ACGTGAAGGT 


GGTAGAACAG 


TAGGAGCCGG 


T 


891 



912 
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PCT/CA00/01150 



2) INFORMATION FOR SEQ ID NO: 1748 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 891 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

10 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella choleraesuis subsp. choleraesuis 
serotype Typhimurium 

(B) STRAIN: LT2 trpE91 

15 (C) ACCESSION NUMBER: X55116 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1748 





AACATGATCA 


CCGGTGCTGC 


TCAGATGGAC 


GGCGCGATCC 


TGGTTGTTGC 


50 


20 


TGCGACTGAC 


GGCCCGATGC 


CGCAGACCCG 


TGAGCACATC 


CTGCTGGGTC 


100 




GTCAGGTAGG 


CGTTCCGTAC 


ATCATCGTGT 


TCCTGAACAA 


ATGCGACATG 


150 




GTTGATGACG 


AAGAGCTGCT 


GGAACTGGTT 


GAGATGGAAG 


TTCGCGAACT 


200 




GCTGTCTCAG 


TACGACTTCC 


CGGGCGACGA 


CACTCCGATC 


GTTCGTGGTT 


250 




CTGCTCTGAA 


AGCGCTGGAA 


GGCGACGCAG 


AGTGGGAAGC 


GAAAATCATC 


300 


25 


GAACTGGCTG 


GCTTCCTGGA 


TTCTTATATT 


CCGGAACCAG 


AGCGTGCGAT 


350 




TGACAAGCCG 


TTCCTGCTGC 


CGATCGAAGA 


CGTATTCTCC 


ATCTCCGGTC 


400 




GTGGTACCGT 


TGTTACCGGT 


CGTGTAGAGC 


GCGGTATCAT 


CAAAGTGGGC 


450 




GAAGAAGTTG 


AAATCGTTGG 


TATCAAAGAG 


ACTCAGAAGT 


CTACCTGTAC 


500 




TGGCGTTGAA 


ATGTTCCGCA 


AACTGCTGGA 


CGAAGGCCGT 


GCCGGTGAGA 


550 


30 


ACGTAGGTGT 


TCTGCTGCGT 


GGTATCAAAC 


GTGAAGAAAT 


CGAACGTGGT 


600 




CAGGTACTGG 


CTAAGCCGGG 


CACCATCAAG 


CCGCACACCA 


AGTTCGAATC 


650 




TGAAGTGTAC 


ATTCTGTCCA 


AAGATGAAGG 


CGGCCGTCAT 


ACTCCGTTCT 


700 




TCAAAGGCTA 


CCGTCCGCAG 


TTCTACTTCC 


GTACTACTGA 


CGTGACTGGT 


750 




ACCATCGAAC 


TGCCGGAAGG 


CGTAGAGATG 


GTAATGCCGG 


GCGACAACAT 


800 


35 


CAAAATGGTT 


GTTACCCTGA 


TCCACCCGAT 


CGCGATGGAC 


GACGGTCTGC 


850 




GTTTCGCAAT 


CCGTGAAGGC 


GGCCGTACCG 


TTGGCGCGGG 


C 


891 



40 2) INFORMATION FOR SEQ ID NO: 1749 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 881 bases 

(B) TYPE: Nucleic acid 
45 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



50 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Shewanella putrefaciens 

(B) STRAIN: DSM 50426 



913 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1749 





ATGATCACTG 


GTGCTGCACA 


GATGGACGGC 


GCGATTCTGG 


TAGTCGCTTC 


50 




AACAGACGGT 


CCAATGCCAC 


AGACTCGTGA 


GCACATCCTG 


CTTTCTCGTC 


100 


5 


AGGTTGGCGT 


ACCATTCATC 


ATCGTATTCA 


TGAACAAATG 


TGACATGGTA 


150 




GATGACGAAG 


AGCTGTTAGA 


GCTAGTTGAG 


ATGGAAGTGC 


GTGAACTGTT 


200 




ATCAGAATAC 


GATTTCCCAG 


GTGATGACTT 


ACCGGTAATC 


CAAGGTTCAG 


250 




CTCTGAAAGC 


GCTAGAAGGC 


GAGCCAGAGT 


GGGAAGCAAA 


AATCCTTGAA 


300 




TTAGCAGCGG 


CGCTGGATTC 


TTACATTCCA 


GAACCACAAC 


GTGACATCGA 


350 


10 


TAAGCCGTTC 


CTACTGCCAA 


TCGAAGACGT 


ATTCTCAATT 


TCAGGCCGTG 


400 




GTACAGTAGT 


AACAGGTCGT 


GTTGAGCGTG 


GTATTGTACG 


CGTAGGCGAC 


450 




GAAGTTGAAA 


TCGTTGGTGT 


ACGTGCGACA 


ACTAAGACAA 


CGTGTACTGG 


500 




TGTAGAAATG 


TTCCGTAAAC 


TGCTTGACGA 


AGGTCGTGCA 


GGTGAGAACT 


550 




GTGGTATTTT 


GTTACGTGGT 


ACTAAGCGTG 


ATGACGTAGA 


ACGTGGTCAA 


600 


15 


GTATTAGCGA 


AGCCAGGTTC 


AATCAACCCA 


CACACTACTT 


TTGAATCAGA 


650 




AGTTTACGTA 


CTGTCAAAAG 


AAGAAGGTGG 


TCGTCACACG 


CCATTCTTCA 


700 




AAGGCTACCG 


TCCACAGTTC 


TACTTCCGTA 


CAACTGACGT 


AACCGGTACT 


750 




ATCGAACTGC 


CAGAAGGCGT 


AGAGATGGTA 


ATGCCAGGCG 


ATAACATCAA 


800 




GATGGTAGTG 


ACACTGATTT 


GCCCAATCGC 


GATGGACGAA 


GGTTTACGCT 


850 


20 


TCGCAATCCG 


TGAAGGCGGT 


CGTACAGTGG 


T 




881 



25 



30 



2) INFORMATION FOR SEQ ID NO: 1750 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 897 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
35 (A) ORGANISM: Stigmatella aurantiaca 

(B) STRAIN: DW4 

(C) ACCESSION NUMBER: X82820 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1750 



40 



45 



50 



AACATGATCA CGGGCGCGGC 

CGCGGCCGAC GGCCCGATGC 

GGCAGGTGGG CGTGCCCTAC 

CTGGACGATC CGGAGCTGCG 

GCTCAAGAAG TACGAGTTCC 

GCGCGCTCAA GGCGCTGGAG 

ATCCTGAAGC TGATGGCGGC 

TGCGACGGAC AAGCCGTTCC 

CAGGCCGAGG AACGGTGGCG 

GTGGGCGAGG AAGTGGAGAT 

CATCACGGGG GTGGAGATGT 

GAGACAACAT CGGAGCGCTG 

CGTGGGCAGG TGCTGGCGAA 



GCAGATGGAC 
CCCAGACGCG 
ATCGTCGTCT 
CGAGCTGGTG 
CGGGCGACAG 
GGAGACACCA 
GGTGGACGAG 
TGATGCCGGT 
ACGGGCCGAG 
CGTGGGGATC 
TCCGCAAGCT 
CTGCGAGGCC 
CTGGGGGAGC 



GGAGCGATTC 
TGAGCACATC 
TCCTGAACAA 
GAGATGGAGG 
CATCCCCATC 
GCGACATCGG 
TACATCCCGA 
GGAAGACGTG 
TGGAGCGCGG 
CGTCCGACGC 
GCTGGACGAG 
TGAAGCGCGA 
ATCAACCCGC 



TGGTGGTGTC 50 

CTGCTGGCCA 100 

GGTGGACATG 150 

TGCGCGACCT 200 

ATCCCTGGCA 250 

CGAGGGAGCG 300 

CGCCGCAGCG 350 

TTCTCCATCG 400 

CAAGATCAAG 4 50 

AGAAGACGGT 500 

GGCATGGCGG 550 

GGACCTGGAG 600 

ACACGAAGTT 650 
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CAAGGCGCAG GTGTACGTGC 

CGTTCTTCAA GGGATACCGG 

ACCGGAACGG TGAAGCTGCC 

CAACATCGCC ATCGAGGTGG 

5 AGCTGCCGTT CGCCATCCGT 



TGTCGAAGGA AGAGGGAGGG 

CCGCAGTTCT ACTTCCGGAC 

GGACAACGTG GAGATGGTGA 

AGCTCATTAC TCCGGTCGCC 

GAGGGTGGCC GCACGGTGGG 



CGGCACACGC 700 

GACGGACGTG 750 

TGCCGGGAGA 800 

ATGGAGAAGG 850 

CGCCGGC 897 



10 



15 



2) INFORMATION FOR SEQ ID NO: 1751 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 897 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double' 

(D) TOPOLOGY: Linear 



20 



25 



30 



35 



30 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Thiomonas cuprina 

(B) STRAIN: Hoe5 

(C) ACCESSION NUMBER: x76871 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1751 



AACATGATCA 
CGCCGCCGAC 
GTCAGGTGGG 
GTCGACGACG 
GCTGTCCAAG 
CGGCCAAGCT 
ATTCTCAAGC 
GGCCGTCGAC 
CCGGGCGCGG 
GTCGGCGAGG 
CTGCACCGGC 
GCGACAACGT 
CGCGGCCAGG 
CACCGCCGAG 
CCTTCTTCAA 
ACCGGCGCCA 
TAATGTGAGC 
GCCTGCGCTT 



CCGGTGCGGC 
GGCCCCATGC 
CGTGCCCTAC 
CCGAGCTGCT 
TACGACTTCC 
GGCCCTCGAA 
TGGCCGAGGC 
GGCGCGTTCC 
CACGGTGGTC 
AAATCGAGAT 
GTGGAAATGT 
CGGCATCTTG 
TGCTGTGCAA 
GTGTACGTGC 
CAACTACCGC 
TCGAACTGCC 
ATCACCGTCA 
CGCCATCCGC 



CCAGATGGAC 
CCCAAACCCG 
ATCATCGTGT 
CGAACTCGTC 
CCGGTGACGA 
GGCGACAAGG 
CCTGGACACC 
TCATGCCCGT 
ACCGGGCGTG 
TGTCGGCCTC 
TCAGGAAGCT 
CTGCGCGGCA 
ACCCGGCTCG 
TGAGCAAGGA 
CCGCAGTTCT 
CAAGGACAAG 
AGCTCATCGC 
GAAGGCGGCC 



GGCGCCATCC 
CGAGCACATC 
TCCTCAACAA 
GAGATGGAAG 
CACCCCCATC 
GCGAACTGGG 
TACATCCCCA 
GGAAGACGTG 
TGGAGCGCGG 
AAGCCCACCC 
GCTCGACCAG 
CCAAGCGCGA 
ATCAAGCCCC 
CGAGGGCGGC 
ACTTCCGCAC 
GAAATGGTCA 
CCCCATCGCC 
GCACCGTCGG 



TGGTCGTGTC 50 

CTGCTGGCGC 100 

GTGCGACATG 150 

TGCGCGAGCT 200 

ATCAAGGGCT 250 

CGAAGGCGCC 300 

CGCCCGAGCG 350 

TTCTCCATCT 400 

CATCATCAAG 450 

TCAAGACCAC 500 

GGCCAGGCCG 550 

GGAAGTCGAG 600 

ACACCCACTT 650 

CGCCACACCC 700 

CACCGACGTC 750 

TGCCCGGCGA 800 

ATGGAAGAAG 850 

CGCCGGC 897 



15 



2) INFORMATION FOR SEQ ID NO: 1752 



(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 894 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



915 
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10 



15 



20 



25 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Treponema pallidum 

(B) STRAIN: Nichols 

(C) ACCESSION NUMBER: AE001202 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1752 



AATATGATCA CGGGTGCTGC 

TGCGCCTGAC GGCGTTATGC 

GTCAGGTTGG TGTTCCCTCC 

GTTGATGATC CTGAGTTGCT 

GCTTGCTGGA TATGGGTTTT 

CGTTTAAAGC TCTGCAGGAT 

GAGGAACTGC TTGCGGCCAT 

CGACGCAAGA CCTTTCTTGC 

GGCGTGGTAC CGTTGTCACG 

AATGAAGAGG TCGAGATCGT 

TACTGGCATT GAGATGTTTA 

ATAACGTGGG GCTGCTTTTG 

GGTCAGGTGC TTTCTAAGCC 

GGCGCAGATC TACGTGCTCT 

TTTTTCAAGG TTATCGTCCG 

GGTACGATTT CTCTTCCTGA 

CACCAAGATT ATAGGTGAGC 

TGAAGCTTGC GATTCGTGAA 



GCAGATGGAC 
CACAGACGAA 
ATCATTGTTT 
AGAGCTGGTG 
CGCGTGAGAC 
GGCGCTTCCC 
GGATTCCTAC 
TCTCTATCGA 
GGGCGCATCG 
CGGGATTAAG 
ATAAGTTGCT 
CGCGGGGTGG 
CGGTTCTATT 
CTAAGGAAGA 
CAGTTTTATT 
AGGGGTAGAC 
TCATCCACCC 
GGGGGGCGCA 



GGTGGTATTC 
GGAGCATCTT 
TTTTGAACAA 
GAAGAAGAGG 
GCCTATCGTC 
CGGAGGATGC 
TTTGAAGACC 
GGATGTGTAC 
AATGTGGGGT 
CCCACTAAGA 
TGATCAGGGA 
ATAAAAAAGA 
AAGCCACACA 
GGGTGGCCGT 
TTAGAACTAC 
ATGGTGAAGC 
GATAGCTATG 
CTATTGCTTC 



TCGTCGTGTC 50 

CTGCTCGCCC 100 

GGTTGATTTG 150 

TGCGTGATGC 200 

AAGGGGTCTG 250 

AGCTTGTATT 300 

CAGTGCGTGA 350 

ACTATTTCTG 400 

AATTAGTCTG 450 

AAACAGTGGT 500 

ATTGCAGGTG 550 

GGTTGAGCGC 600 

CCAAGTTTGA 650 

CACAGTCCTT 700 

TGACATTACC 750 

CGGGGGATAA 800 

GACAAGGGTC 850 

TGGT 894 



30 



2) INFORMATION FOR SEQ ID NO: 1753 



(i) ■ SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 891 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

40 (ii) MOLECULE TYPE: Genomic DNA 



(vi) • ORIGINAL SOURCE: 

(A) ORGANISM: Ureaplasma vrealyticum 

(B) STRAIN: ATCC 33697 

45 (C) ACCESSION NUMBER: Z34275 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1753 

AATATGATTA CAGGGGCAGC ACAAATGGAT GGAGCAATTT TAGTTATTGC 

50 TGCATCTGAT GGGGTTATGG CTCAAACTAA AGAACATATT TTATTAGCAC 

GTCAAGTTGG TGTTCCAAAA ATCGTTGTTT TCTTAAACAA ATGTGATTTC 

ATGACAGATC CAGATATGCA AGATCTTGTT GAAATGGAAG TTCGTGAATT 

ATTATCTAAA TATGGATTTG ATGGCGATAA CACACCAGTT ATTCGTGGTT 



50 
100 
150 
200 
250 
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LAGG1 L 1 1 AA 


LGL1 1 1 AbAA 


boHbn 1 LLHL 


tttp apaapp 


AAAA ATTP AT 


inn 

J VJ VJ 




GAA1 1 AA1GG 


7\ PPP APTTT TV 

ALbbAb 1 1 GA 


r P r rp7\ r rr , 7A 7att 

1 1 LA 1 1 1 


pp att apc ac 


A A PPT APT AP 


J J VJ 




T>/^* 7\ O TV A Tv /""" TV. 

1 GAlAAAClA 


T"7 , r ,r PT TV TV (~* 

1 1L 1 1 Al 1 AG 


LAA1 IbAAbA 


TPT ATTP APA 


ATTTP APP AP 
Mill LrtLL-rlv^ 


400 

*i VJ VJ 




G1GGTACAGT 


Abl AAL IbGA 


Lb 1 vj 1 1 L/iftL 


PTPPTPT ATT 
\j 1 LL 1 \j i n i i 


AA A APTTA AT 


450 
i j \j 


r— 

5 


GATGAGGTTG 


TV TV 7\ T T 1 T 1 T 1 /"* 

AAATTGI IGG 


T , P r P7AZV.7A7AP7AP 

1 L 1 AAAAb AL 


APTP A A A A A A 
riL 1 Lrt/irtttrttt 


PTPTTPTT AP 
Llbl 1 LI IriL 


500 




7\ /T^ TV TV T>rri/-'7\ TV 

AGGAATTGAA 


A I G i 1 1 AGAA 


7V.7ATP7ATT7AP7A 


TP A APPTPA A 


PPTPPTPATA 
LL 1 oo 1 Ori 1 


550 




tv m/"* /"*< m /-» m TV m 

ATGCTGGTAT 


ro rr* ^i Tv tv 

TTTATTACGT 


GGi Al 1 AAAA 


7A AP A TAP A TPT 
/iRLrlHLri 1 u 1 


TP A APPTPPT 
1 L/xrtLL 1 LL 1 


600 
ovj vj 




caagtacttg 


m tv t\ 7\ tv /"* z™ 1 7\ f* r~* 

TAAAACCAGG 


Al LAAL 1 AAA 


PPTP 7APPPT7A 


LI 1 1 1 ML 1 LL 


f%50 

O J VJ 




m 7V 7v Tt /** mmm tv m 

TAAAGTTTAT 


7\ rn rp PTT 7\ TV TV Tv 

ATTCTTAAAA 


AAGAAGAAGG 


TPP 7A PPTP 7AT 


AP A PPT ATTP 
nLnLLlnl 1 L 




10 


TTTCAGGATA 


LLb I LLALAA 


1 1 L 1 Al 1 1 1 A 


PA APA AP APA 
bnnLrinLnbn 


TPTA AP APPT 


7S0 




GCTATTTCAT 


TACCTGCTGG 


TGTTGATTTG 


GTTATGCCAG 


GTGATGACGT 


800 




TGAAATGACT 


GTAGAATTAA 


TTGCTCCAGT 


TGCGATTGAA 


GATGGATCTA 


850 




AATTCTCAAT 


CCGTGAAGGT 


GGTAAAACTG 


TAGGTCATGG 


T 


891 


15 


2) INFORMATION FOR SEQ ID NO: 1754 










(i) SEQUENCE CHARACTERISTICS: 








20 


(A) 


LENGTH: 909 


bases 










(B) 


TYPE: Nucleic acid 










(C) 


STRANDEDNESS : Double 










(D) 


TOPOLOGY: Linear 








25 


(ii) MOLECULE TYPE: Genomic DNA 










<vi) ORIGINAL SOURCE: 












(A) 


ORGANISM: Wolinella succinogenes 








(B) 


STRAIN: DSM 


1740 








30. 


(C) 


ACCESSION NUMBER: X76862 








(xi) SEQUENCE DESCRIPTION: SEQ ID 


NO: 1754 








AACATGATTA 


CAGGTGCTGC 


TCAAATGGAT 


GGCGCGATTC 


TTPTTPTTTP 


so 


35 


TGCGGCGGAT 


GGCCCCATGC 


CCCAAACTAG 


GGAGCACATT 


PTT'PTTTPTP 
L11L111L1L 


1 00 

J. VJ u 




GACAAGTAGG 


CGTTCCTTAC 


ATCGTGGTTT 


TCTTGAACAA 


APA APATATP 


1 50 

J. «J u 




GTTGATGACG 


CTGAGCTTCT 


TGAGCTTGTT 


GAAATGGAAG 


TT A PAPA APT 
1 1 MLML-rVJ-iL 1 


P00 
^ u u 




TCTTAGCAAC 


TACGACTTCC 


CTGGAGATGA 


CACTCCTATC 


PTTPP APPTT 
LI iuLnool 1 


950 




CCGCTCTTAA 


AGCTCTTGAA 


GAGGCTAACG 


ACCAGGAAAA 


tpttppppap 

loll uvjv/ono 


100 

J VJ VJ 


40 


TGGGGCGAGA 


AAGTATTGAA 


GCTTATGGCT 


GAGGTTGACC 


PAT ATATTPP 


150 

J *J VJ 




TACGCCTGAG 


CGAGATGTGG 


ATAAGCCTTT 


CCTTATGCCT 


PTTPA APAPP 
Oil LrtrtOrtOVj 


4 00 

*a VJ VJ 




TATTCTCCAT 


CGCGGGTCGT 


GGAACCGTTG 


TGACAGGAAG 


A ATTP A A APA 


4 50 

*3 *J VJ 




GGCGTGGTTA 


AAGTCGGTGA 


CGAAGTAGAA 


ATCGTTGGTA 


TPPPA AAPAP 
1 LLLrtrvttLrtL 


500 

«J VJ VJ 




ACAAAAAACA 


ACCGTAACTG 


GCGTTGAGAT 


GTTCCGAAAA 


P APPTPPAPA 
bnoL 1 LLrVLrv 


550 

J *J VJ 


45 


AGGGTGAGGC 


GGGTGACAAC 


GTTGGTGTTC 


TTTTGAGAGG 


P APP A APA A A 
LnL LHjHLMJtJA 


fiOO 

D VJ VJ 




GAAGATGTTG 


AGAGAGGTAT 


GGTTCTTTGT 


AAAATAGGTT 


PT ATP APTPP 
Llnl LftV— 1 LL 


650 

VV*J VJ 




TCACACTAAC 


TTTGAAGGTG 


AAGTTTACGT 


TCTTTCCAAA 


PAPPA APPPP 


700 

/ VJ VJ 




G AC G AC AC AC 


TCCATTCTTC 


AATGGATACC 


GACCTCAGTT 


PTATPTTAPA 
Llnlul 1 non 


750 

I *J VJ 




ACTACAGACG 


TTACCGGTTC 


TATCTCTCTT 


CCTGAGGGCG 


1 rtvrrtV3.fi A VJVJ X 


800 

U v w 


50 


TATGCCTGGT 


GACAACGTTA 


AGATCAATGT 


TGAGCTTATC . 


PPTPPTPTAP 
LL 1 LL 1 LI rtL 


R 50 

O *J VJ 




CCCTCGAAGA 


GGGAACACGA 


TTCGCGATCC 


GTGAAGGTGG 


TCGAACCGTT 


900 




GGTGCGGGT 










909 
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2) INFORMATION FOR SEQ ID NO: 1755 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 888 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Burkholderia cepacia 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1755 



PCT/CAOO/01150 



ATGATCACGG 
AGCAGACGGC 
AGGTTGGTGT 
GACGACGCTG 
GTCGAAGTAC 
CCAAGCTGGC 
ATGAGCCTGG 
AGTTGACGGC 
GCCGTGGTAC 
GGCGAAGAAA 
CACGGGCGTT 
ACAACGTCGG 
GGCCAGGTTC 
GGCTGAAGTG 
TCTTCAACAA 
GGCTCGATCG 
CGTGTCGATC 
TGCGCTTCGC 



GCGCAGCGCA 
CCGATGCCGC 
TCCGTACATC 
AACTGCTCGA 
GACTTCCCGG 
GCTGGAAGGC 
CAGACGCGCT 
GCGTTCCTGA 
GGTGGTGACG 
TCGAAATCGT 
GAAATGTTCC 
TATCCTGCTG 
TGGCGAAGCC 
TACGTGCTGA 
CTACCGTCCG 
AGCTGCCGAA 
ACGGTGAAGC 
AATCCGTGAA 



GATGGACGGC 
AAACGCGTGA 
ATCGTGTTCC 
GCTGGTCGAG 
GCGACGACAC 
GACACGGGCG 
GGACACGTAC 
TGCCGGTGGA 
GGTCGTGTCG 
CGGTATCAAG 
GCAAGCTGCT 
CGCGGCACGA 
GGGTTCGATC 
GCAAGGACGA 
CAGTTCTACT 
GGACAAGGAA 
TGATTGCTCC 
GGCGGCCGTA 



GCGATCCTGG 
GCACATCCTG 
TGAACAAGTG 
ATGGAAGTTC 
GCCGATCGTG 
AGCTGGGCGA 
ATCCCGACGC 
AGACGTGTTC 
AGCGCGGCAT 
CCGACGGTGA 
GGACCAAGGT 
AGCGTGAAGA 
ACGCCGCACA 
AGGCGGCCGT 
TCCGTACGAC 
ATGGTGATGC 
GATCGCGATG 
CGGTCGGC 



TTTGCTCGGC 50 

CTGGCGCGTC 100 

CGACAGTGTG 150 

GCGAACTCCT 200 

AAGGGTTCGG 250 

AGTGGCGATC 300 

CGGAGCGTGC 350 

TCGATCTCGG 400 

CGTGAAGGTC 450 

AGACGACCTG 500 

CAGGCAGGCG 550 

CGTGGAGCGT 600 

CGCACTTCAC 650 

CACACGCCGT 700 

GGACGTGACG 750 

CGGGCGACAA 800 

GAAGAAGGTC 850 
888 



35 



40 



45 



50 



2) INFORMATION FOR SEQ ID NO: 1756 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 388 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus anthracis 

(B) STRAIN: CIP 9444 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1756 
TGAAAGTTCA GGTAAAACAA CAGTTTCATT ACACGCAATT GCAGAAGTAC 
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AGCGTCAAGG TGGACAAGCA GCGTTCATTG ATGCTGAGCA TGCAATGGAT 
CCTGTATATG CACAAAAACT AGGTGTTAAC ATCGATGAAT TACTATTATC 
ACAACCTGAT ACAGGGGAGC AAGGTTTAGA AATCGCAGAA GCACTTGTAC 
GAAGTGGTGC GGTTGATATT ATCGTAATTG ACTCTGTAGC AGCTCTTGTA 
CCGAAAGCTG AAATTGAAGG AGACATGGGT GACTCACACG TAGGTTTACA 
AGCTCGTCTA ATGTCTCAAG CACTTCGTAA ACTTTCAGGT GCAATCAATA 
AATCAAAAAC AATCGCAATC TTTATTAACC AAATTCGT 



100 
150 
200 
250 
300 
350 
388 



2) INFORMATION FOR SEQ ID NO: 1757 

(i) SEQUENCE CHARACTERISTICS:, 

(A) LENGTH: 388 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus anthracis 

(B) STRAIN: ATCC 4229 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1757 

TGAAAGTTCA GGTAAAACAA CAGTTTCATT ACACGCAATT GCAGAAGTAC 50 

AGCGTCAAGG TGGACAAGCA GCGTTCATTG ATGCTGAGCA TGCAATGGAT 100 

CCTGTATATG CACAAAAACT AGGTGTTAAC ATCGATGAAT TACTATTATC 150 

ACAACCTGAT ACAGGGGAGC AAGGTTTAGA AATCGCAGAA GCACTTGTAC 200 

GAAGTGGTGC GGTTGATATT ATCGTAATTG ACTCTGTAGC AGCTCTTGTA 250 

CCGAAAGCTG AAATTGAAGG AGACATGGGT GACTCACACG TAGGTTTACA 300 

AGCTCGTCTA ATGTCTCAAG CACTTCGTAA ACTTTCAGGT GCAATCAATA 350 

AATCAAAAAC AATCGCAATC TTTATTAACC AAATTCGT 388 



2) INFORMATION FOR SEQ ID NO: 1758 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 388 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus cereus 

(B) STRAIN: ATCC 7064 

(C) ACCESSION NUMBER: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1758 
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TGAAAGTTCA GGTAAAACAA CAGTTTCATT 
AGCGTCAAGG TGGACAAGCA GCGTTCATTG 
CCTGTATATG CACAAAAACT AGGTGTTAAC 
5 ACAACCTGAT ACAGGGGAGC AAGGTTTAGA 
GAAGTGGTGC GGTTGATATT ATCGTAATTG 
CCGAAAGCTG AAATTGAAGG AGACATGGGT 
AGCTCGTCTA ATGTCTCAAG CACTTCGTAA 
AATCAAAAAC AATCGCAATC TTTATTAACC 



ACACGCAATT GCAGAAGTAC 50 

ATGCTGAGCA TGCAATGGAT 100 

ATCGATGAAT TACTATTATC 150 

AATCGCAGAA GCACTTGTAC 200 

ACTCTGTAGC AGCTCTTGTA 250 

GACTCACACG TAGGTTTACA 300 

ACTTTCAGGT GCAATCAATA 350 

AAATTCGT 388 



2) INFORMATION FOR SEQ ID NO: 1759 

15 (i) SEQUENCE CHARACTERISTICS: 

. (A) LENGTH: 388 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



20 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus cereus 
25 (B) STRAIN: ATCC 13472 

(C) ACCESSION NUMBER: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1759 

30 TGAAAGTTCA GGTAAAACGA CAGTTTCATT ACATGCAATT GCAGAAGTAC 50 

AACGTCAAGG TGGACAAGCA GCATTCATCG ATGCGGAGCA CGCAATGGAT 100 

CCTGTATATG CACAAAAATT AGGCGTTAAC ATAGATGAAT TACTATTATC 150 

ACAGCCTGAT ACAGGGGAGC AAGGATTAGA AATCGCGGAA GCACTTGTAC 200 

GAAGTGGTGC GGTTGACATT ATCGTAATTG ACTCTGTAGC AGCTCTTGTA 250 

35 CCGAAAGCAG AGATTGAAGG TGACATGGGT GACTCACACG TAGGTTTACA 300 

r AGCACGTTTA ATGTCACAAG CACTTCGTAA GCTTTCAGGA GCAATCAACA 350 

AATCAAAAAC AATTGCAATC TTTATTAACC AAATTCGT 388 



40 



50 



2) INFORMATION FOR SEQ ID NO: 1760 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 374 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus mycoides 

(B) STRAIN: ATCC 64 62 
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(xi) SEQUENCE DESCRIPTION: SEQ 

AAACGACAGT TTCATTACAT GCAATTGCAG 

5 CAAGCAGCAT TCATCGATGC GGAGCACGCA 

AAAATTAGGC GTTAACATAG ATGAATTACT 

GGGAGCAAGG ATTAGAAATC GCAGAAGCAC 

GACATTATCG TAATTGACTC TGTAGCAGCT 

TGAAGGAGAC ATGGGTGACT CACACGTAGG 

10 CACAAGCACT TCGTAAGCTT TCAGGAGCAA 

GCAATCTTTA TTAACCAAAT TCGT 
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ID NO: 1760 

AAGTACAACG TCAAGGTGGA 50 

ATGGATCCTG TATATGCACA 100 

ATTATCACAG CCTGATACAG 150 

TTGTACGAAG TGGTGCGGTT 200 

CTTGTACCGA AAGCAGAGAT 250 

TTTACAAGCA CGTTTAATGT 300 

TCAACAAATC AAAAACAATT 350 

374 



15 2) INFORMATION FOR SEQ ID NO: 1761 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 381 bases 

(B) v TYPE: Nucleic acid 
20 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

25 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus pseudomycoides 

(B) STRAIN: NRRL BD-10 



30 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1761 



GGAAAGTTCA GGTAAAACAA CGGTTTCCTT ACATGCGATT GCAGAAGTGC 50 

AACGTCAAGG TGGACAAGCG GCATTTATTG ATGCGGAGCA TGCGATGGAT 100 

CCTGTATATG CACAAAAGTT AGGTGTTAAT ATTGATGAGT TACTATTATC 150 

GCAGCCTGAT ACAGGAGAAC AAGGTTTAGA AATCGCAGAA GCATTAGTAC 200 

35 GAAGCGGTGC GATTGATATC ATTGTAATTG ACTCTGTAGC AGCTCTTGTA 250 

CCAAAAGCAG AAATCGAAGG GGAAATGGGT GACTCCCACG TTGGTTTACA 300 

AGCGCGTTTA ATGTCACAAG CACTTCGTAA GCTTTCTGGT GCGATTAACA 350 

AATCAAAAAC AATTGCAATC TTCATTAACC A 381 



2) INFORMATION FOR SEQ ID NO: 1762 



(i) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 388 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

50 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus thuringiensis 
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(B) STRAIN: HER 1410 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1762 



TGAAAGTTCA GGTAAAACGA 
AACGTCAAGG TGGACAAGCA 
CCTGTATATG CACAAAAATT 
ACAGCCTGAT ACAGGGGAGC 
GAAGTGGTGC GGTTGACATT 
CCGAAAGCAG AGATTGAAGG 
AGCACGTTTA ATGTCACAAG 
AATCAAAAAC AATTGCAATC 



CAGTTTCATT ACATGCAATT GCAGAAGTAC 
GCATTCATCG ATGCGGAGCA CGCAATGGAT 
AGGCGTTAAC ATAGATGAAT TACTATTATC 
AAGGATTAGA AATCGCGGAA GCACTTGTAC 
ATCGTAATTG ACTCTGTAGC AGCTCTTGTA 
CGACATGGGT GACTCACACG TAGGTTTACA 
CACTTCGTAA GCTTTCAGGA GCAATCAACA 
TTTATTAACC AAATTCGT 



50 
100 
150 
200 
250 
300 
350 
388 



2) INFORMATION FOR SEQ ID NO: 1763 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 388 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus thuringiensis 

(B) STRAIN: HER 1418 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1763 

TGAAAGTTCA GGTAAAACGA CAGTTTCATT ACATGCAATT GCAGAAGTAC 50 

AACGTCAAGG TGGACAAGCA GCATTCATTG ATGCGGAGCA CGCAATGGAT 100 

CCTGTATATG CACAAAAATT AGGCGTTAAC ATAGATGAAT TACTATTATC 150 

ACAGCCTGAT ACAGGGGAGC AAGGATTGGA AATCGCGGAA GCACTTGTAC 200 

GAAGTGGTGC GGTTGACATT ATCGTAATTG ACTCTGTAGC AGCTCTTGTA 250 

CCGAAAGCAG AGATTGAAGG CGATATGGGT GACTCACACG TAGGTTTACA 300 

AGCACGTTTA ATGTCACAAG CACTTCGTAA GCTTTCAGGA GCAATCAACA 350 

AATCAAAAAC AATTGCAATC TTTATTAACC AAATTCGT 388 



2) INFORMATION FOR SEQ ID NO: 1764 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 358 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
(vi) ORIGINAL SOURCE: 
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(A) ORGANISM: Klebsiella oxytoca 

(B) STRAIN: ATCC 33496 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1764 

5 

CTCCTATCTG GATTATGCGA TGTCGGTCAT TGTTGGCCGT GCGCTGCCGG 50 

ATGTCCGAGA TGGCCTGAAG CCGGTACACC GTCGCGTACT ATACGCCATG 100 

AACGTATTGG GCAATGACTG GAACAAAGCC TATAAAAAAT CTGCCCGTGT 150 

CGTGGGTGAC GTCATCGGTA AATACCACCC TCATGGTGAT ACTGCCGTAT 200 

10 ACGACACCAT TGTACGTATG GCGCAGCCCT TCTCCCTGCG TTACATGCTG 250 

GTAGATGGCC AGGGTAACTT TGGTTCGGTC GACGGCGACT CCGCCGCAGC 300 

GATGCGTTAT ACGGAAATCC GTATGTCGAA GATCGCCCAT GAACTGATGG 350 

CCGACCTC 358 



15 



30 



45 



2 ) INFORMATION FOR SEQ ID NO: 1765 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 365 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella pneumoniae subsp. ozaenae 

(B) STRAIN: ATCC 11296 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1765 



TTAAGAACTC TTATCTGGAT TATGCGATGT CGGTCATTGT TGGCCGTGCG 50 

CTGCCGGATG TCCGAGATGG CCTGAAGCCG GTACACCGTC GCGTACTTTA 100 

35 CGCCATGAAC GTATTGGGCA ATGACTGGAA CAAAGCCTAT AAAAAATCAG 150 

CCCGTGTCGT TGGTGACGTA ATCGGTAAAT ACCACCCGCA CGGCGACTCC 200 

GCGGTATACG ACACCATCGT GCGTATGGCG CAGCCGTTCT CGCTGCGTTA 250 

CATGCTGGTG GACGGCCAGG GTAACTTTGG TTCCATCGAC GGCGACTCCG 300 

CCGCGGCGAT GCGTTATACC GAAATTCGTC TGGCGAAAAT CGCTCATGAG 350 

40 CTGATGGCCG ATCTT 365 



2) INFORMATION FOR SEQ ID NO: 1766 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 344 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
50 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE : 

(A) ORGANISM: Klebsiella planticola 

(B) STRAIN: ATCC 33531 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1766 



AAGAGCTCGT ATCTGGATTA 
GCCGGATGTC CGAGATGGCC 
CCATGAACGT ATTGGGCAAT 
CGTGTCGTTG GTGACGTAAT 
CGTTTATGAC ACCATTGTAC 
TGCTGGTCGA TGGCCAGGGT 
GCAGCGATGC GTTATACGGA 



TGCGATGTCG GTCATTGTTG 
TGAAACCGGT ACACCGTCGC 
GACTGGAACA AAGCCTATAA 
CGGTAAATAC CACCCTCATG 
GTATGGCACA GCCATTCTCC 
AACTTCGGTT CTGTCGATGG 
AATCCGTATG TCGAAAATCG 



GCCGTGCGCT 
GTACTTTACG 
AAAATCCGCC 
GTGATACCGC 
TTGCGTTATA 
CGACTCCGCC 
CCCA 



50 
100 
150 
200 
250 
300 
344 



2) INFORMATION FOR SEQ ID NO: 1767 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 345 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella pneumoniae 

(B) STRAIN: ATCC 27 336 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1767 

AACTCTTATC TGGATTATGC GATGTCGGTC ATTGTTGGCC GTGCGCTGCC 50 

GGATGTCCGA GATGGCCTGA AGCCGGTACA CCGTCGCGTA CTTTACGCCA 100 

TGAACGTATT GGGCAATGAC TGGAACAAAG CCTATAAAAA ATCAGCCCGT 150 

GTCGTTGGTG ACGTAATCGG TAAATACCAC CCGCACGGCG ACTCCGCGGT 200 

ATACGACACC ATCGTGCGTA TGGCGCAGCC GTTCTCGCTG CGTTACATGC 250 

TGGTGGACGG CCAGGGTAAC TTTGGTTCCA TCGACGGCGA CTCCGCCGCG 300 

GCGATGCGTT ATACCGAAAT TCGTCTGGCG AAAATCGCTC ATGAG 345 



2) INFORMATION FOR SEQ ID NO: 1768 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 356 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
(vi) ORIGINAL SOURCE: 
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(A) ORGANISM: Klebsiella pneumoniae subsp. pneumoniae 

(B) STRAIN: ATCC 13883 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1768 

5 

CTTATCTGGA TTATGCGATG TCGGTCATTG TTGGCCGTGC GCTGCCGGAT 50 

GTCCGAGATG GCCTGAAGCC GGTACACCGT CGCGTACTTT ACGCCATGAA 100 

CGTATTGGGC AATGACTGGA ACAAAGCCTA TAAAAAATCA GCCCGTGTCG 150 

TTGGTGACGT AATCGGTAAA TACCACCCGC ACGGCGACTC CGCGGTATAC 200 

10 GACACCATCG TGCGTATGGC GCAGCCGTTC TCGCTGCGTT ACATGCTGGT 250 

GGACGGCCAG GGTAACTTTG GTTCCATCGA CGGCGACTCC GCCGCGGCGA 300 

TGCGTTATAC CGAAATTCGT CTGGCGAAAA TCGCTCATGA GCTGATGGCC 350 

GATCTT 356 



15 



30 



2) INFORMATION FOR SEQ ID NO: 1769 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 361 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella pneumoniae subsp. pneumoniae 

(B) STRAIN: ATCC 29011 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1769 



TAAGAACTCT TATCTGGATT ATGCGATGTC GGTCATTGTT GGCCGTGCGC 50 

TGCCGGATGT CCGAGATGGC CTGAAGCCGG TACACCGTCG CGTACTTTAC 100 

35 GCCATGAACG TATTGGGCAA TGACTGGAAC AAAGCCTATA AAAAATCAGC 150 

CCGTGTCGTT GGTGACGTAA TCGGTAAATA CCACCCGCAC GGCGACTCCG 200 

CGGTATACGA CACCATCGTG CGTATGGCGC AGCCGTTCTC GCTGCGTTAC 250 

ATGCTGGTGG ACGGCCAGGG TAACTTTGGT TCCATCGACG GCGACTCCGC 300 

CGCGGCGATG CGTTATACCG AAATTCGTCT GGCGAAAATC GCTCATGAGC 350 

40 TGATGGCCGA T 361 



45 



2) INFORMATION FOR SEQ ID NO: 1770 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 365 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
50 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella pneumoniae subsp. 
rhinoscleromatis 

(B) STRAIN: ATCC 13824 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1770 





TAAGAACTCT 


TATCTGGATT 


ATGCGATGTC 


GGTCATTGTT 


GGCCGTGCGC 


50 




TGCCGGATGT 


CCGAGATGGC 


CTGAAGCCGG 


TACACCGTCG 


CGTACTTTAC 


100 


10 


GCCATGAACG 


TATTGGGCAA 


TGACTGGAAC 


AAAGCCTATA 


AAAAATCAGC 


150 




CCGTGTCGTT 


GGTGACGTAA 


TCGGTAAATA 


CCACCCGCAC 


GGCGACTCCG 


200 




CGGTATACGA 


CACCATCGTG 


CGTATGGCGC 


AGCCGTTCTC 


GCTGCGTTAC 


250 




ATGCTGGTGG 


ACGGCCAGGG 


TAACTTTGGT 


TCCATCGACG 


GCGATTCCGC 


300 




CGCGGCGATG 


CGTTATACCG 


AAATTCGTCT ' 


GGCGAAAATC 


GCTCATGAGC 


350 


15 


TGATGGCCGA 


TCTTG 








365 



2) INFORMATION FOR SEQ ID NO: 1771 



20 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 357 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
25 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
30 (A) ORGANISM: Klebsiella terrigena 

(B) STRAIN : ATCC 33257 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1771 

35 TCGTATCTGG ATTATGCGAT GTCGGTCATT GTTGGCCGTG CGCTGCCGGA 50 

TGTCCGAGAT GGGTTGAAAC CGGTACACCG TCGCGTACTT TACGCCATGA 100 

ACGTATTGGG CAATGACTGG AACAAAGCCT ATAAAAAATC CGCCCGTGTC 150 

GTTGGTGACG TAATCGGTAA ATATCACCCT CACGGTGATA CCGCCGTTTA 200 

TGACACCATT GTACGTATGG CGCAGCCATT CTCCTTGCGT TATATGCTGG 250 

4 0 TCGATGGCCA GGGTAACTTC GGTTCTGTCG ATGGCGACTC CGCCGCAGCG 300 

ATGCGTTATA CGGAAATCCG TATGTCGAAA ATCGCCCACG AGCTGATGGC 350 

CGACCTC 357 



45 

2) INFORMATION FOR SEQ ID NO: 1772 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 968 bases 
50 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Legionella pneumophila subsp. pneumophila 

(B) STRAIN: ATCC 33152 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1772 



CAGTACACAG GCGAGTTCTT TTTGCGATGA 
AATAAGCCGT ATAAAAAATC TGCTCGTGTA 
ATATCATCCT CATGGGGATA CAGCTGTTTA 
CTCAGCCCTT TTCCATGCGT TATATGCTGA 
GGCTCTGTAG ATGGAGATGC TCCAGCTGCC 
AATGTCCAAA GTGGCGCATG CTTTACTGGC 
TTGATTTTAG TCCTAACTAT GATGAAACAG 
CCATCGAGAA TTCCCAATTT ACTAGTTAAT 
AGGGATGGCT ACTAATATTC CACCACATAA 
CATGTATTGC TTTAGTGGAT GAACCTGACA 
GAAATTATTC CTGGCCCTGA TTTTCCTACA 
TGCTGGAATT ATTGAAGGTT ATCGTACTGG 
GGGCACGCAC AGAAATTGAA ACGGATGAAA 
ATTATTCAGG AATTACCCTA TCAGGTGAAT 
TATTGCTGAA TTGGTAAGGG ACAAGAAAAT 
GAGATGAGTC AGACAAGCAA GGAATGAGAG 
AATGAAGTAG CAGATGTGGT ATTGAATAAC 
GCAAAATGTA TTCGGAATTA ATATGGTTGC 
GTACTTTGAA TTTGAAGCAA ATACTGGAAT 
GAGGTTGTTA CCAGACGC 



GTGAGTTAAG TAATGATTGG 50 

GTAGGGGATG TCATTGGTAA 100 

TGACACTATT GTTCGTATGG 150 

TTGATGGGCA GGGTAATTTT 200 

ATGCGTTACA CTGAAGTAAG 250 

TGATTTGGAT AAGGAAACCG 300 

AATTTGCTCC AGTGGTATTG 350 

GGCTCTTCCG GTATTGCGGT 4 00 

TCTTACCGAA GTAATCAATG 4 50 

CGAGTCTTGA AGATTTAATG 500 

GCCGCAATTA TTAATGGTCG 550 

AAAAGGGCGG GTTGTTATCA 600 

GTTCAGGCCG TCAGTCAATT 650 

AAAGCGCGTT TGATCGAGCG 700 

CGAAGGAATT TCCGGCTTGA 750 

TAGTCATTGA ATTAAAACGC 800 

CTGTTCGCTC ATACTCAAAT 850 

TCTGGTGGAT GGCCAACCGC 900 

ATTTTATAAA ACATCGAAGA 950 

968 



2) INFORMATION FOR SEQ ID NO: 1773 



ORIGINAL SOURCE: 

(A) ORGANISM: Proteus mirabilis 

(B) STRAIN: ATCC 25933 

SEQUENCE DESCRIPTION: SEQ ID NO: 1773 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 967 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
(vi) 



(xi) 

CACCGCCGAG TGCTATTTGC GATGAATGTA 
ACCTTATAAA AAATCAGCCC GTGTTGTTGG 
ACCCGCACGG TGACAGTGCT GTCTATGAAA 
CCTTTTTCTA TGCGCTACAT GTTGGTTGAC 
AGTTGATGGT GACTCCGCGG CGGCTATGCG 
CGAAGATCGC CCATGAACTG CTGGCGGATT 



CTGGGAAACG ATTGGAATAA ' 50 

GGATGTAATC GGTAAATATC 100 

CGATTGTTCG TTTAGCACAG 150 

GGTCAGGGTA ACTTCGGGTC 200 

TTATACCGAA GTTCGTATGG 250 

TGGAAAAAGA GACGGTCGAC 300 
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TTTGTTCCTA ACTATGATGG AACAGAAAAT 
CCGTATTCCA AACTTGTTAG TTAATGGCTC 
TGGCAACGAA TATCCCTCCG CATAACCTCG 
CTTGCCTATG TTGATAATGA AGACATCACC 
5 TATTACCGGG CCTGATTTTC CGACTGCTGC 
GAATATTAGA TGCTTATCGT ACAGGGCGTG 
CAGGCTGATA TTGAAACTGA TGAGAAAACA 
GACAGAAATT CCTTATCAGG TGAATAAAGC 
CGGAGCTTGT AAAAGATAAA CGTATTGAAG 
10 GAGTCTGATA AAGACGGTAT GCGTATTGTT 
AGTCGGTGAA GTAGTATTAA ATCACCTCTT 
TCTCTTTTGG TATTAATATG GTTGCGCTTC 
TTGAACCTAA AAGAAATTAT CGCAGCCTTT 
GGTGACTCGC CGTACCA 

15 



PCT/CA00/01150 

ATACCGGCTG TTATGCCAAC 350 

TTCAGGTATT GCCGTTGGGA 400 

GTGAAGTTAT CGACGGTTGT 4 50 

ATAGAAGAAT TAATGGAATA 500 

GATTATTAAT GGTCGCAGAG 550 

GAAAGATTTA TATCCGTGCT 600 

GGTCGCGAAA CCATTATCGT 650 

CCGTTTAATT GAAAAAATTG 700 

GTATCAGCGG ATTACGTGAC 750 

GTTGAGATCA AACGTGATGC 800 

TTCACAAACT CAAATGCAAG 850 

ATCAAGGCCA ACCAAAATTA 900 

ATTCGCCATC GTCGTGAAGT 950 

967 



2) INFORMATION FOR SEQ ID NO: 1774 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 978 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



25 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Providencia rettgeri 
30 (B) STRAIN: ATCC 9250 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1774 

CCAGTACACC GCAGAGTATT GTATGCGATG AATGTATTGG GAAATGATTG 50 

35 GAATAAACCC TATAAAAAAT CTGCCCGTAT TGTTGGGGAC GTCATCGGTA 100 

AATACCATCC ACATGGTGAT AGCGCTGTTT ACGAGACAAT CGTTCGTCTT 150 

GCTCAGCCTT TCTCAATGCG TTACATGCTG GTTGATGGTC AGGGAAACTT 200 

CGGTTCTGTT GACGGAGACT CCGCAGCGGC AATGCGTTAT ACGGAAATCC 250 

GTATGGCGAA AATTGCCCAT GAACTACTTG CTGACCTTGA AAAAGAAACC 300 

40 GTTGATTTCG TTCCTAACTA TGACGGAACA GAGCACATTC CTGAAGTTAT 350 

GCCAACGAAA ATCCCAAACC TTTTGGTTAA TGGGTCGTCA GGTATTGCTG 400 

TTGGGATGGC AACCAATATC CCACCTCACA ATTTAGGGGA GGTGATTAAT 450 

GGTTGTCTTG CCTATATAGA AGACGAAGAC ATCAGCATTG ATGGTTTAAT 500 

GGAACACATT CCAGGGCCTG ATTTCCCAAC CGCAGCTATT ATTAATGGCC 550 

45 GTCGTGGGAT TATTGATGCG TATCGCACAG GGCGTGGCAA GGTCTATATC 600 

CGTGCAAGCG CTGAAGTGGA AGTCGATGAG AAAAATGGTC GCGAAACCAT 650 

TATTGTCAGC GAAATTCCTT ATCAAGTGAA TAAAGCTCGC TTGATTGAAA 700 

AAATTGCTGA GTTAGTTAAA GACAAGCGTG TTGAAGGTAT CAGTGCACTG 750 

CGTGACGAGT CTGATAAAGA CGGTATGCGT ATTGTTATTG AAATCAAACG 800 

50 CGATGCGGTG GGTGAAGTTG TACTGAACAA CTTATATTCC CTGACCCAAT 850 

TGCAAGTTTC TTTTGGTATC AATATGGTAG CTCTACACCA AGGGCAGCCG 900 

AAAATACTGA ATTTAAAAGA TATCATTGCT GCTTTTGTGC GTCACCGCCG 950 

TGAAGTCGTC ACTCGTCGTA CGATTTTC 97 8 

928 
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2) INFORMATION FOR SEQ ID NO: 1775 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 978 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



10 



15 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Proteus vulgaris 

(B) STRAIN: ATCC 13315 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1775 



CCAGTACACC 
GAATAAACCT 
AATATCACCC 

20 GCACAGCCTT 
CGGGTCAGTT 
GTATGGCGAA 
GTTGATTTTG 
GCCAACCCGT 

25 TCGGGATGGC 
GGCTGTCTTG 
GGAGCATATC 
GCAGAGGAAT 
CGTGCACAAG 

3 0 TATCGTGACG 
AAATTGCAGA 
CGTGATGAGT 
TGATGCTGTT 
TGCAGGTTTC 

35 AAAGTGTTAA 
TGAAGTGGTG 



GTCGCGTTCT 
TATAAAAAAT 
GCACGGTGAC 
TTTCTATGCG 
GATGGTGACT 
AATCGCCCAT 
TTCCTAACTA 
ATTCCAAACT 
AACAAACATT 
CTTATGTTGA 
ACGGGGCCTG 
TTTAGATGCT 
CTGATGTAGA 
GAAATTCCTT 
GCTTGTTAAA 
CAGATAAAGA 
GGTGAAGTTG 
TTTTGGTATT 
ACCTGAAAGA 
ACTCGTCGTA 



TTTCGCGATG 
CAGCCCGTGT 
AGTGCTGTTT 
TTACATGTTG 
CGGCGGCTGC 
GAACTGCTGG 
TGATGGAACA 
TATTAGTCAA 
CCTCCGCATA 
TAACGAAGAC 
ATTTCCCAAC 
TACCGTACTG 
AACCGATGAG 
ATCAGGTGAA 
GATAAACGTA 
TGGTATGCGC 
TACTAAATCA 
AACATGGTTG 
AATTATTTCA 
CTATTTTT 



AATGTATTAG 
TGTTGGGGAT 
ATGAAACGAT 
GTTGACGGGC 
AATGCGTTAT 
CGGATTTGGA 
GAGCATATCC 
TGGTTCTTCA 
ACCTAGGTGA 
ATCACTATTG 
TGCCGCTATT 
GGCGCGGAAA 
AAAACTGGTC 
CAAAGCTCGC 
TTGAAGGCAT 
ATTGTTGTTG 
CTTATTTTCT 
CACTGCATCA 
GCCTTTATTC 



GAAACGATTG 50 

GTTATCGGTA 100 

AGTTCGTTTA 150 

AGGGTAACTT 200 

ACCGAAGTTC 250 

AAAAGAAACG 300 

CGGCAGTCAT 350 

GGTATCGCAG 400 

AGTTATCGAC 450 

AAGAGTTGAT 500 

ATTAATGGCC 550 

AATTTATATT 600 

GCGAAACAAT 650 

TTAATTGAAA 700 

TAGCGGATTA 750 

AAATTAAGCG 800 

CAGACTCAGA 850 

AGGTCAACCG 900 

GTCACCGTCG 950 
978 



40 2) INFORMATION FOR SEQ ID NO: 1776 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 940 bases 

(B) TYPE: Nucleic acid 
45 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

50 (Vi) ORIGINAL SOURCE: 

(A) ORGANISM: Yersinia enterocolitica 

(B) STRAIN: ATCC 9610 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1776 

TGGGTAATGA CTGGAATAAG CCATACAAAA AATCGGCCCG TGTAGTCGGG 50 

GACGTTATCG GTAAATATCA CCCGCATGGT GACAGCGCGG TCTACGACAC 100 

5 AATTGTGCGT ATGGCCCAGC CGTTCTCACT GCGCTATATG CTGGTGGATG 150 

GGCAGGGCAA CTTCGGTTCC GTTGATGGCG ACTCCGCCGC AGCGATGCGT 200 

TATACCGAAA TCCGTATGTC TAAAATTGCT CACGAATTGT TGGCGGACTT 250 

AGAAAAAGAT ACCGTCGACT TCGTGCCGAA CTATGACGGT ACGGAGCAAA 300 

TTCCTGCCGT AATGCCAACC CGAATCCCTA ACTTGCTGGT TAACGGCTCG 350 

10 TCAGGTATTG CTGTCGGTAT GGCAACCAAT ATTCCGCCGC ATAACCTTTC 400 

TGAGGTTATT GATGGCTGTC TGGCCTATAT CGAAGATGAA AACATCACCA 4 50 

TTGAAGGGTT GATGGAGTAC ATCCCGGGGC CAGATTTCCC AACTGCTGCG 500 

ATTATCAATG GTCGCCGTGG TATTGAAGAA GCTTATCGTA CTGGCCGTGG 550 

CAAGGTGTAT ATCCGTGCCC GTGCTGAAGT' TGAGGCTGAC GCTTAAACCG 600 

15 GTCGCGAAAC CATTATTGTT CACGAGATCC CGTATCAGGT GAACAAGGCG 650 

CGGTTGATTG AAAAAATCGC CGAGCTGGTT AAAGAAAAAC GCGTAGAAGG 700 

CATCAGTGCG TTGCGTGATG AGTCTGATAA AGACGGCATG CGTATCGTGA 750 

TTGAAATCAA ACGTGATGCT GTCGGGGAAG TGGTTCTGAA CAACCTCTAT 800 

TCTCTGACGC AATTGCAGGT GACTTTCGGT ATCAATATGG TGGCTCTGTC 850 

20 TCAAGGGCAG CCTAAGTTGC TTAACCTGAA AGACATTTTG GTTGCTTTCG 900 

TGCGCCACCG CCGTGAAGTG GTGACTCGCC GTACCATTTT 940 



25 2) INFORMATION FOR SEQ ID NO: 1777 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 668 bases 

(B) TYPE: Nucleic acid 
30 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

35 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella oxytoca 

(B) STRAIN: ATCC 13182 



4 0. 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1777 



CCGTTTATTG GCGATGGCCT GAAGCCGGTC CAGCGTCGCA TCGTCTATGC 50 

GATGTCTGAA CTGGGTCTGA ACGCCAGCGC GAAGTTCAAA AAGTCCGCCC 100 

GCACCGTCGG TGACGTGCTG GGTAAATACC ATCCCCACGG CGACAGCGCG 150 

TGCTATGAAG CCATGGTGCT GATGGCTCAG CCCTTCTCCT ACCGCTATCC 200 

4 5 GCTGGTTGAC GGTCAGGGAA ACTGGGGGGC GCCGGACGAT CCTAAATCCT 250 

TCGCCGCAAT GCGTTATACC GAATCCCGTT TGTCGAAGTA TGCTGAACTG 300 

CTGCTGAGCG AACTGGGGCA AGGCACCGTT GACTGGGTAC CAAACTTCGA 350 

CGGCACTTTG CAGGAGCCGA AGATGCTGCC TGCGCGCCTG CCCAATATTC 4 00 

TGCTAAACGG TACTACCGGC ATTGCCGTTG GGATGGCGAC GGACATTCCG 4 50 

50 CCGCACAACC TGCGTGAAGT GGCCCGGGCG GCCATTACCC TGATTGAAAA 500 

GCCGCAAACT TCGCTGGATG ACCTGCTGGA TATCGTGCAG GGGCCGGATT 550 

ATCCTACCGA AGCGGAAATC ATTACCCCCC GTGCCGAAAT CCGCAAAATC 600 

TACCAGAATG GCCGCGGTTC GGTGCGGATG CGCGCGGTAT GGGCCAAAGA 650 

930 
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5 2) INFORMATION FOR SEQ ID NO: 1778 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 714 bases 

(B) TYPE: Nucleic acid 
10 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

15 (vi) ORIGINAL SOURCE: 

(A) ORGANISM:^ Klebsiella oxytoca 

(B) STRAIN: ATCC 33496 



20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1778 



GTTTATTGGC GATGGCCTGA AGCCGGTCCA GCGTCGCATC GTCTATGCGA 50 

TGTCTGAACT GGGTCTGAAC GCCAGCGCGA AGTTCAAAAA GTCCGCCCGC 100 

ACCGTCGGTG ACGTGCTGGG TAAATACCAT CCCCACGGCG ACAGCGCGTG 150 

CTATGAAGCC ATGGTGCTGA TGGCTCAGCC CTTCTCCTAC CGCTATCCGC 200 

2 5 TGGTTGACGG TCAGGGAAAC TGGGGGGCGC CGGACGATCC TAAATCCTTC 250 

GCCGCAATGC GTTATACCGA ATCCCGTTTG TCGAAGTATG CTGAACTGCT 300 

GCTGAGCGAA CTGGGGCAAG GCACCGTTGA CTGGGTACCA AACTTCGACG 350 

GCACTTTGCA GGAGCCGAAG ATGCTGCCTG CGCGCCTGCC CAATATTCTG 400 

CTAAACGGTA CTACCGGCAT TGCCGTTGGG ATGGCGACGG ACATTCCGCC 4 50 

30 GCACAACCTG CGTGAAGTGG CCCGGGCGGC CATTACCCTG ATTGAAAAGC 500 

CGCAAACTTC GCTGGATGAC CTGCTGGATA TCGTGCAGGG GCCGGATTAT 550 

CCTACCGAAG CGGAAATCAT TACCCCCCGT GCCGAAATCC GCAAAATCTA 600 

CCAGAATGGC CGCGGTTCGG TGCGGATGCG CGCGGTATGG GCCAAAGAAG 650 

ACGGCGCGGT GGTGATTAGC GCGCTGCCGC ATCAGGTTTC CGGCGCCCGC 700 

35 GTGCTTGAGC AGAT 714 



2) INFORMATION FOR SEQ ID NO: 1779 



40 



(i) SEQUENCE. CHARACTERISTICS: 

(A) LENGTH: 722 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
45 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
50 (A) ORGANISM: Klebsiella pneumoniae subsp. ozaenae 

(B) STRAIN: ATCC 11296 ( 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1779 

931 
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TTGGCGATGG CTTAAAACCG GTCCAGCGTC GCATCGTCTA TGCGATGTCC 50 

GAGCTGGGGC TGAACGCCAG CGCGAAATTC AAAAAGTCCG CCCGCACCGT 100 

CGGCGACGTG TTGGGTAAAT ATCACCCGCA CGGCGACAGC GCCTGCTATG 150 

5 AAGCGATGGT GCTGATGGCG CAGCCGTTCT CTTACCGCTA TCCGCTGGTG 200 

GATGGTCAGG GAAACTGGGG GGCGCCGGAC GATCCCAAAT CTTTCGCCGC 250 

CATGCGTTAC ACCGAATCCC GCCTGTCGAA GTATGCCGAG CTGCTGCTCA 300 

GCGAGCTGGG GCAGGGGACG GTCGACTGGG TGCCAAACTT TGACGGTACG 350 

CTGCAGGAGC CGAAAATGCT GCCAGCGCGT TTGCCGAACA TCCTGCTGAA 4 00 

10 CGGCACCACC GGCATCGCGG TAGGTATGGC GACCGATATT CCTCCGCACA 4 50 

ACCTGCGGGA AGTGGCCAAA GCGGCGATTA CGCTGATTGA GCAGCCGAAA 500 

ACCACCCTCG ACGAACTGCT GGATATCGTA CAGGGGCCGG ATTTCCCGAC 550 

CGAGGCGGAG ATCATCACCT CGCGGGCGGA AATTCGAAAA ATCTACCAGA 600 

ACGGGCGCGG CTCAGTGCGC ATGCGCGCGG TGTGGAGTAA AGAGGACGGC 650 

15 GCGGTGGTGA TCAGCGCGCT GCCGCATCAG GTCTCCGGCG CCAAAGTGCT 700 

GGAGCAGATT GCGGCGCAGA TG 722 



20 2) INFORMATION FOR SEQ ID NO: 1780 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 692 bases 

(B) TYPE: Nucleic acid 
25 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

30 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella planticola 

(B) STRAIN: ATCC 33531 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1780 



GTCCAGCGTC GCATCGTTTA TGCGATGTCT GAGCTGGGGC TGAACGCCAG 50 

CGCGAAGTTC AAAAAGTCCG CCCGCACCGT GGGTGATGTG CTGGGTAAAT 100 

ATCACCCGCA CGGCGATAGC GCATGCTATG AAGCTATGGT GCTGATGGCG 150 

CAGCCATTCT CCTACCGCTA CCCGCTGGTG GATGGGCAGG GGAACTGGGG 200 

4 0 GGCACCGGAC GATCCTAAAT CCTTCGCCGC GATGCGTTAT ACCGAATCCC 250 

GTTTGTCGAA GTATGCGGAA CTGCTGCTGG GCGAACTGGG GCAGGGAACC 300 

GTCGACTGGG TGCCGAACTT CGACGGGACG ATGCAGGAGC CGAAAATGCT 350 

GCCTGCGCGT CTGCCGAATA TTCTGCTGAA CGGCACTACC GGCATCGCCG 400 

TCGGTATGGC AACCGATATT CCTCCGCACA ACCTGCGTGA AGTGGCGCAG 4 50 

4 5 GCGGCGATTA CCCTGATCGA AAAACCGCAG ACCTCGCTCG ACGAACTGCT 500 

GGATATCGTT CACGGACCCG ACTACCCGAC CGAAGCCGAA ATCATTACTC 550 

CACGCGCGGA GATCCGCAAA ATCTACCAGA ACGGCCGCGG TTCGGTGCGG 600 

ATGCGCGCGG TATGGAAAAA AGAGGACGGC GCGGTCGTGA TTACGGCGTT 650 

GCCGCATCAG GTTTCCGGCG CCCGCGTGCT GGAGCAAATT GC 692 

50 



2) INFORMATION FOR SEQ ID NO: 1781 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 700 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella pneumoniae 

(B) STRAIN: ATCC 27336 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1781 

GTTTATTGGC GATGGCTTAA AACCGGTCCA GCGTCGCATC GTCTATGCGA 50 

TGTCCGAGCT GGGGCTGAAC GCCAGCGCGA AATTCAAAAA GTCCGCCCGC 100 

ACCGTCGGCG ACGTGTTGGG TAAATATCAC CCGCACGGCG ACAGCGCCTG 150 

CTATGAAGCG ATGGTGCTGA TGGCGCAGCC GTTCTCTTAC CGCTATCCGC 200 

TGGTGGATGG TCAGGGAAAC TGGGGGGCGC CGGACGATCC CAAATCTTTC 250 

GCCGCCATGC GTTACACCGA ATCCCGCCTG TCGAAGTATG CCGAGCTGCT 300 

GCTCAGCGAG CTGGGGCAGG GGACGGTCGA CTGGGTGCCA AACTTTGACG 350 

GCACGCTGCA GGAGCCGAAA ATGCTGCCAG CGCGTTTGCC GAACATCCTG 400 

CTGAACGGCA CCACCGGCAT CGCGGTAGGC ATGGCGACCG ATATTCCTCC 4 50 

GCACAACCTG CGGGAAGTGG CCAAAGCGGC GATTACGCTG ATTGAGCAGC 500 

CGAAAACCAC CCTCGACGAA CTGCTGGATA TCGTACAGGG GCCGGATTTC 550 

CCGACCGAGG CGGAGATCAT CACCTCGCGG GCGGAAATTC GCAAAATCTA 600 

CCAAAACGGG CGCGGCTCAG TGCGCATGCG CGCGGTGTGG AGTAAAGAGG 650 

ACGGCGCGGT GGTGATCAGC GCGCTGCCGC ATCAGGTCTC CGGCGCCAAA 700 



2) INFORMATION FOR SEQ ID NO: 1782 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 726 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella pneumoniae subsp. pneumoniae 

(B) STRAIN: ATCC 13883 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1782 

ATTGGCGATG GCTTAAAACC GGTCCAGCGT CGCATCGTCT ATGCGATGTC 50 

CGAGCTGGGG CTGAACGCCA GCGCGAAATT CAAAAAGTCC GCCCGCACCG 100 

TCGGCGACGT GTTGGGTAAA TATCACCCGC ACGGCGACAG CGCCTGCTAT 150 

GAAGCGATGG TGCTGATGGC GCAGCCGTTC TCTTACCGCT ATCCGCTGGT 200 

GGATGGTCAG GGAAACTGGG GGGCGCCGGA CGATCCCAAA TCTTTCGCCG 250 

933 
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CCATGCGTTA CACCGAATCC CGCCTGTCGA 
AGCGAGCTGG GGCAGGGGAC GGTCGACTGG 
GCTGCAGGAG CCGAAAATGC TGCCAGCGCG 
ACGGCACCAC CGGCATCGCG GTAGGCATGG 
5 AACCTGCGGG AAGTGGCCAA AGCGGCGATT 
AACCACCCTC GACGAACTGC TGGATATCGT 
CCGAGGCGGA GATCATCACC TCGCGGGCGG 
AACGGGCGCG GCTCAGTGCG CATGCGCGCG 
CGCGGTGGTG ATCAGTGCGC TGCCGCATCA 
10 TGGAGCAGAT TGCGGCGCAG ATGCGC 



PCT/CAOO/01150 

AGTATGCCGA GCTGCTGCTC 300 

GTGCCAAACT TTGACGGCAC 350 

TCTGCCGAAC ATCCTGCTGA 400 

CGACCGATAT TCCTCCGCAC 450 

ACGCTGATTG AGCAGCCGAA 500 

ACAGGGGCCG GATTTCCCGA 550 

AAATTCGCAA AATCTACCAG 600 

GTGTGGAGTA AAGAGGACGG 650 

GGTCTCTGGC GCCAAAGTGC 700 

726 



2) INFORMATION FOR SEQ ID NO: 1783 



15 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 706 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
20 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
25 (A) ORGANISM: Klebsiella pneumoniae subsp. pneumoniae 

(B) STRAIN: ATCC 29011 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1783 

30 GGCTTAAAAC CGGTCCAGCG TCGCATCGTC TATGCGATGT CCGAGCTGGG 50 

GCTGAACGCC AGCGCGAAAT TCAAAAAGTC CGCCCGCACC GTCGGCGACG 100 

TGTTGGGTAA ATATCACCCG CACGGCGACA GCGCCTGCTA TGAAGCGATG 150 

GTGCTGATGG CGCAGCCGTT CTCTTACCGC TATCCGCTGG TGGATGGTCA 200 

GGGAAACTGG GGGGCGCCGG ACGATCCCAA ATCTTTTGCC GCCATGCGTT 250 

35 ACACCGAATC CCGCCTGTCG AAGTATGCCG AGCTGCTGCT CAGCGAGCTG 300 

GGGCAGGGGA CGGTCGACTG GGTGCCAAAC TTTGACGGCA CGCTGCAGGA 350 

GCCGAAAATG CTGCCAGCGC GTCTGCCGAA CATCCTGCTG AACGGCACCA 400 

CCGGCATCGC GGTAGGCATG GCGACCGATA TTCCTCCGCA CAACCTGCGG 450 

GAAGTGGCCA AAGCGGCGAT TACGCTGATT GAGCAGCCGA AAACCACCCT 500 

4 0 CGACGAACTG CTGGATATCG TACAGGGGCC GGATTTCCCG ACCGAGGCGG 550 

AGATCATCAC CTCGCGGGCG GAAATTCGCA AAATCTACCA GAACGGGCGC 600 

GGCTCAGTGC GCATGCGCGC GGTGTGGAGT AAAGAGGACG GCGCGGTGGT 650 

GATCAGTGCG CTGCCGCATC AGGTCTCCGG CGCCAAAGTG CTGGAGCAGA 700 

TTGCGG 706 

45 



2) INFORMATION FOR SEQ ID NO: 1784 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 614 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

934 
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(ii) MOLECULE TYPE: Genomic DNA 

5 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella pneumoniae subsp. 
rhinoscleromatis 

(B) STRAIN: ATCC 13884 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1784 

TTGGCGATGG CTTAAAACCG GTCCAGCGTC GCATCGTCTA TGCGATGTCC 50 

GAGCTGGGGC TGAACGCCAG CGCGAAATTC AAAAAGTCCG CCCGCACCGT 100 

CGGCGACGTG TTGGGTAAAT ATCACCCGCA CGGCGACAGC GCCTGCTATG 150 

15 AAGCGATGGT GCTGATGGCG CAGCCGTTCT CTTACCGCTA TCCGCTGGTG 200 

GATGGTCAGG GAAACTGGGG GGCGCCGGAC GATCCCAAAT CTTTCGCCGC 250 

CATGCGTTAC ACCGAATCCC GCCTGTCGAA GTATGCCGAG CTGCTGCTCA 300 

GCGAGCTGGG GCAGGGGACG GTCGACTGGG TGCCAAACTT TGACGGCACG 350 

CTGCAGGAGC CGAAAATGCT GCCAGCGCGT CTGCCGAACA TCCTGCTGAA 400 

20 CGGCACCACC GGCATCGCGG TAGGCATGGC GACCGATATT CCTCCGCACA 450 

ACCTGCGGGA AGTGGCCAAA GCGGCGATTA CGCTGATTGA GCAGCCGAAA 500 

ACCACCCTCG ACGAACTGCT GGATATCGTA CAGGGGCCGG ATTTCCCGAC 550 

CGAGGCGGAG ATCATCACCT CGCGGGCGGA AATTCGCAAA ATCTACCAGA 600 

ACGGGCGCGG CTCA 614 

25 



30 



35 



2) INFORMATION FOR SEQ ID NO: 1785 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 668 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella terrigena 
40 (B) STRAIN: ATCC 33257 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1785 



GCCTGAAACC 
45 CTGAACGCCA 
GCTGGGTAAA 
TGCTGATGGC 
GGCAACTGGG 
TACCGAATCC 
50 GTCAGGGAAC 
CCGAAAATGC 
CGGTATCGCC 
AAGTGGCCAA 



GGTCCAGCGG 
CCGCTAAATT 
TATCACCCGC 
GCAGCCGTTC 
GCGCCCCGGA 
CGCCTGTCAA 
CGTTGACTGG 
TGCCTGCGCG 
GTGGGGATGG 
AGCGGCCATC 



CGCATCGTTT 
CAAAAAATCC 
ACGGCGATAG 
TCTTACCGCT 
CGATCCCAAA 
AGTATGCGGA 
GTACCTAACT 
TTTGCCGAAT 
CCACCGATAT 
ACCCTGATTG 



ATGCGATGTC 
GCGCGCACCG 
CGCCTGCTAT 
ACCCGCTGGT 
TCCTTCGCCG 
GCTGCTGCTG 
TTGACGGTAC 
ATTCTGCTCA 
TCCGCCGCAC 
AAAAGCCGCA 



CGAACTGGGT 50 

TCGGCGACGT 100 

GAGGCGATGG 150 

GGACGGTCAG 200 

CGATGCGTTA 250 

GGCGAGCTGG 300 

GATGCAGGAG 350 

ACGGCACCAC 400 

AACCTGCGCG 450 

GACCTCGCTC 500 
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GACGAACTGC TGGATATCGT TCACGGGCCG GACTACCCCA CCGAAGCTGA 
AATCATCACC CCGCGCGCCG AGATCCGCAA AATCTATCAG AACGGTCGCG 
GCTCGGTTCG CATGCGTGCG GTGTGGAAAA AAGAGGACGG CGCGGTGGTG 
ATTAGCGCCC TGCCGCAT 



550 
600 
650 
668 



2) INFORMATION FOR SEQ ID NO: 1786 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 113 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double, 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus cereus 

(B) STRAIN: ATCC 7064 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1786 

CATTACGTTC TAACACTCAA GGACGCGGAA CATTCTCTAT GGTGTTTGAC 50 
CACTATGAAG AAGTACCAAA GTCTGTTTCT GAAGAAATTA TCAAAAAAAA 100 
TAAAGGTGAA TAA 113 



2) INFORMATION FOR SEQ ID NO: 1787 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 118 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus cereus 

(B) STRAIN: ATCC 14 579 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1787 

AACGTCATTA CGTTCTAACA CTCAAGGACG CGGAACATTC TCTATGGTGT 50 
TTGACCACTA TGAAGAAGTA CCAAAGTCTG TTTCTGAAGA AATTATCAAA 100 
AAAAATAAAG GTGAATAA 118 



2) INFORMATION FOR SEQ ID NO: 1788 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 120 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
5 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
10 (A) ORGANISM: Bacillus anthracis 

(B) STRAIN: CIP 9444 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1788 

15 GCAACGTCAT TACGTTCTAA CACTCAAGGA CGCGGAACAT TCTCTATGGT 50 
GTTTGACCAC TATGAAGAAG' TACCAAAGTC TGTTTCTGAA GAAATTATCA 100 
AAAAAAATAA AGGTGAATAA 120 



20 



30 



2) INFORMATION FOR SEQ ID NO: 1789 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 118 bases 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus cereus 

(B) STRAIN: ATCC 13472 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1789 

AACGTCATTA CGTTCTAACA CTCAAGGACG CGGAACATTC TCTATGGTGT 50 

TTGACCACTA TGAAGAAGTA CCAAAGTCTG TTTCTGAAGA AATTATCAAA 100 

AAAAATAAAG GTGAATAA 118 

40 



2) INFORMATION FOR SEQ ID NO: 1790 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 120 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



50 



(ii) MOLECULE TYPE: Genomic DNA 
(vi) ORIGINAL SOURCE: 
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(A) ORGANISM: Bacillus anthracis 

(B) STRAIN: ATCC 4229 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1790 



GCAACGTCAT TACGTTCTAA CACTCAAGGA CGCGGAACAT TCTCTATGGT 
GTTTGACCAC TATGAAGAAG TACCAAAGTC TGTTTCTGAA GAAATTATCA 
AAAAAAATAA AGGTGAATAA 



50 
100 
120 



2) INFORMATION FOR SEQ ID NO: 1791 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 120 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus pseudomycoides 

(B) STRAIN: NRRL B-617 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1791 

GCAACGTCAT TACGTTCTAA CACGCAAGGA CGCGGAACAT TCTCAATGAC 50 
ATTTGATCAT TATGAAGAAG TACCGAAGTC TGTTTCAGAA GAAATTATCA 100 
AAAAAAATAA AGGTGAATAA 120 



2) INFORMATION FOR SEQ ID NO: 1792 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 116 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus cereus 

(B) STRAIN: ATCC 49064 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1792 

CGTCATTACG TTCTAACACT CAAGGACGCG GAACATTCTC TATGGTGTTT 50 
GACCACTATG AAGAAGTACC AAAGTCTGTT TCTGAAGAAA TTATCAAAAA 100 
AAATAAAGGT GAATAA 116 
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2) INFORMATION FOR SEQ ID NO: 1793 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 120 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus anthracis 

(B) STRAIN: CIP 9440 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17 93 

GCAACGTCAT TACGTTCTAA CACTCAAGGA CGCGGAACAT TCTCTATGGT 50 

GTTTGACCAC TATGAAGAAG TACCAAAGTC TGTTTCTGAA GAAATTATCA 100 

AAAAAAATAA AGGTGAATAA 120 

20 



2) INFORMATION FOR SEQ ID NO: 1794 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 100 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



30 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus cereus 
35 (B) STRAIN: ATCC 15816 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1794 

CACTCAAGGA CGCGGAACAT TCTCTATGGT GTTTGATCAC TATGAAGAAG 50 
4 0 TACCAAAGTC TGTTTCTGAA GAAATTATCA AAAAAAATAA AGGTGAATAA 100 



2) INFORMATION FOR SEQ ID NO: 1795 

45 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 120 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
50 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus weihenstephanensis 

(B) STRAIN: WSBC 10204 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1795 



GCAACGGCAT TACGTTCTAA CACTCAAGGA CGCGGAACAT TCTCAATGAC 
ATTTGATCAT TATGAAGAAG TACCGAAGTC TGTTTCTGAA GAAATTATTA 
AAAAAAATAA AGGTGAATAA 



50 
100 
120 



2) INFORMATION FOR SEQ ID NO: 179,6 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 120 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus mycoides 

(B) STRAIN: ATCC 6462 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1796 

GCGACAGCAT TACGTTCTAA CACTCAAGGA CGCGGAACAT TCTCAATGAC 50 
ATTTGATCAT TATGAAGAAG TACCGAAGTC TGTTTCAGAA GAAATTATTA 100 
AAAAAAATAA AGGCGAATAA 120 



2) INFORMATION FOR SEQ ID NO: 1797 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 120 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus thuringiensis 

(B) STRAIN: ATCC 10792 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1797 

GCAACGTCAT TACGTTCTAA CACTCAAGGA CGCGGAACAT TCTCTATGGT 50 
GTTTGACCAC TATGAAGAAG TACCAAAGTC TGTTTCTGAA GAAATTATCA 100 
AAAAAAATAA AGGTGAATAA 120 
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2 ) INFORMATION FOR SEQ ID NO: 1798 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 117 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus weihenstephanensis 

(B) STRAIN: WSBC 10204 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1798 

TTGATTTTTA TCGATTGTTC AAGTATAACT ACTTATGTAA GCTTAGAAAG 50 
TGAGACGCAA GTTTCACTTT CTAGTCTAAA TATAAAATAA CCCATATAAA 100 
CTAAGGAGGA ATTTAGA 117 



2) INFORMATION FOR SEQ ID NO: 1799 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 117 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus thuringiensis 

(B) STRAIN: ATCC 10792 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1799 

TTGATTTTTA TCGATTGTTC AAGTATAACT ACTTATGTAA GCTTAGAAAG 50 
TGGGACGTAA GTTTCACTTT CTAGTCTAAA TATAAAATAA CCTATATAAA 100 
CTAAGGAGGA ATTTAGA 117 



2) INFORMATION FOR SEQ ID NO: 1800 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 117 bases 

(B) . TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus anthracis 

(B) STRAIN: ATCC 4229 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1800 



TTGATTTTTA TCGATTCTTC ACGTATAACT ACTTATGTAA GCTTAGAAAG 
TGGGACGCAA GTTTCGCTTT CTAGCCTAAA TATAAAATAA CCTATATAAA 
CTAAGGAGGA ATTTAGA 



50 
100 
117 



2 ) INFORMATION FOR SEQ ID NO: 1801 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 117 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus pseudomycoides 

(B) STRAIN: NRRL B-617 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1801 

TTGATTTTTA TCGATTGTTC AAGTATAACT ACTTATGTAA GCTTAGAAAG 50 
TGGGACTTAA GTTTCACTTT CTAGTCTAAA TATAAAATAA CCTATATAAA 100 
CTAAGGAGGA ATTTAGA 117 



2) INFORMATION FOR SEQ ID NO: 1802 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 117 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus anthracis 

(B) STRAIN: CIP 9444 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1802 

TTGATTTTTA TCGATTCTTC ACGTATAACT ACTTATGTAA GCTTAGAAAG 50 
TGGGACGCAA GTTTCGCTTT CTAGCCTAAA TATAAAATAA CCTATATAAA 100 
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CTAAGGAGGA ATTTAGA 117 

5 2) INFORMATION FOR SEQ ID NO: 1803 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 117 bases 

(B) TYPE: Nucleic acid 
10 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

15 (vi) ORIGINAL SOURCE: 

(A) ORGANISM:* Bacillus cereus 

(B) STRAIN: ATCC 7064 



20 



25 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1803 

TTGATTTTTA TCGATTCTTC ACGTATAACT ACTTATGTAA GCTTAGAAAG 50 

TGAGACGCAA GTTTCGCTTT CTAGCCTAAA TATAAAATAA CCTATATAAA 100 

CTAAGGAGGA ATTTAGA 117 



2) INFORMATION FOR SEQ ID NO: 1804 



(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 117 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

35 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM Bacillus cereus 

(B) STRAIN: ATCC 49064 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1804 



TTGATTTTTA TCGATTCTTC ACGTATAACT ACTTATGTAA GCTTAGAAAG 50 
TGGGACGCAA GTTTCACTTT CTAGCCTAAA TATAAAATAA CCTATATAAA 100 
4 5 CTAAGGAGGA ATTTAGA 117 



2) INFORMATION FOR SEQ ID NO: 1805 

50 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 117 bases 

(B) TYPE: Nucleic acid 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
5 (A) ORGANISM: Bacillus anthracis 

(B) STRAIN: CIP 9440 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1814 



10 


ATGGCTAAAG 


CTAAATTCGA 


ACGTTCTAAA 


CCCCATGTTA 


ACATCGGTAC 


50 




AATCGGCCAC 


GTTGACCATG 


GTAAAACTAC 


ATTAACTGCT 


GCGATCACTA 


100 




CAGTACTTGC 


AAAAGCTGGT 


GGTGCTGAAG 


CACGCGGATA 


CGATCAAATC 


150 




GATGCTGCTC 


CAGAAGAAAG 


AGAGCGCGGT , 


ATCACAATCT 


CAACTGCACA 


200 




CGTTGAGTAC 


GAAACTGAAA 


ctcgtcacta' 


TGCACACGTT 


GACTGCCCAG 


250 


15 


GTCACGCTGA 


CTATGT 








266 



2) INFORMATION FOR SEQ ID NO: 1815 



20 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 269 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
25 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
30 (A) ORGANISM: Bacillus cereus 

(B) STRAIN: ATCC 7064 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1815 



35 


ATGGCTAAAG 


CTAAATTCGA 


ACGTTCTAAA 


CCCCATGTTA 


ACATCGGTAC 


50 




AATCGGCCAC 


GTTGACCATG 


GTAAAACTAC 


ATTAACTGCT 


GCGATCACTA 


100 




CAGTACTTGC 


AAAAGCTGGT 


GGTGCTGAAG 


CACGCGGATA 


CGATCAAATC 


150 




GATGCTGCTC 


CAGAAGAAAG 


AGAGCGCGGT 


ATCACAATCT 


CAACTGCACA 


200 




CGTTGAGTAC 


GAAACTGAAA 


CTCGTCACTA 


TGCACACGTT 


GACTGCCCAG 


250 


40 


GTCACGCTGA 


CTATGTTAA 








269 



45 



2) INFORMATION FOR SEQ ID NO: 1816 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 268 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
50 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus mycoides 

(B) STRAIN: ATCC 64 62 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1805 

TTGATTTTTA TCAATTGTTC GAGTATAACT ACTTATGTAA GCTTAGAAAG 50 
TGGGACGTAA GTTTCGCTTT CTAGTCTAAA TATAAAATAA CCTATATAAA 100 
CTAAGGAGGA ATTTAGA 117 



2) INFORMATION FOR SEQ ID NO: 1806 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 117 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus cereus 

(B) STRAIN: ATCC 14579 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1806 

TTGATTTTTA TCGATTGTTC AAGTATAACT ACTTATGTAA GCTTAGAAAG 50 
TGGGACGTAA GTTTCACTTT CTAGTCTAAA TATAAAATAA CCTATATAAA 100 
CTAAGGAGGA ATTTAGA 117 



2) INFORMATION FOR SEQ ID NO: 1807 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 117 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus cereus 

(B) STRAIN : ATCC 15816 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1807 

TTGATTTTTA TCGATTCTTC ACGTATAACT ACTTATGTAA GCTTAGAAAG 50 

TGGGACGCAA GTTTCGCTTT CTAGCCTAAA TATAAAATAA CCTATATAAA 100 

5 CTAAGGAGGA ATTTAGA 117 



2) INFORMATION FOR SEQ ID NO: 1808 



10 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 117 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
15 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
20 (A) ORGANISM: Bacillus cereus 

(B) STRAIN: ATCC 13472 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1808 

25 TTGATTTTTA TCGATTGTTC AAGTATAACT ACTTATGTAA GCTTAGAAAG 50 
TGGGACGTAA GTTTCACTTT CTAGTCTAAA TATAAAATAA CCTATATAAA 100 
CTAAGGAGGA ATTTAGA 117 



30 



40 



2) INFORMATION FOR SEQ ID NO: 1809 



(i) SEQUENCE CHARACTERISTICS: 
(A) v LENGTH: 117 bases 
35 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus anthracis 

(B) STRAIN: CIP 9440 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1809 

TTGATTTTTA TCGATTCTTC ACGTATAACT ACTTATGTAA GCTTAGAAAG 50 

TGGGACGCAA GTTTCGCTTT CTAGCCTAAA TATAAAATAA CCTATATAAA 100 

CTAAGGAGGA ATTTAG 117 

50 



2) INFORMATION FOR SEQ ID NO: 1810 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 278 bases 

(B) TYPE: Nucleic acid 

5 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

10 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus mycoides 

(B) STRAIN: ATCC 6462 



15 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1810 



ATGGCTAAAG CTAAATTCGA ACGTTCTAAA CCCCATGTTA ACATCGGTAC 50 

AATCGGCCAC GTTGACCATG GTAAAACTAC ATTAACTGCT GCGATCACTA 100 

CAGTTCTTGC AAAAGCTGGT GGTGCTGAAG CACGCGGATA CGATCAAATC 150 

GACGCTGCTC CAGAAGAAAG AGAGCGCGGA ATCACAATCT CAACTGCACA 200 

20 CGTTGAGTAC GAAACTGAAA CTCGTCACTA TGCACACGTT " GACTGCCCAG 250 

GTCACGCTGA CTATGTTAAA AACATGAT 278 



25 2) INFORMATION FOR SEQ ID NO: 1811 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 278 bases 

(B) TYPE: Nucleic acid 
30. (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

35 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus thuringiensis 

(B) STRAIN: ATCC 10792 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1811 



ATGGCTAAAG 


CTAAATTCGA 


ACGTTCTAAA 


CCCCATGTTA 


ACATCGGTAC 


50 


AATCGGCCAC 


GTTGACCATG 


GTAAAACTAC 


ATTAACTGCT 


GCGATCACTA 


100 


CAGTTCTTGC 


AAAAGCTGGT 


GGTGCTGAAG 


CACGCGGATA 


CGATCAAATC 


150 


GATGCTGCTC 


CAGAAGAAAG 


AGAGCGCGGT 


ATCACAATCT 


CAACTGCACA 


200 


45 CGTTGAGTAC 


GAAACTGAAA 


CTCGTCACTA 


TGCACACGTT 


GACTGCCCAG 


250 


GTCACGCTGA 


CTATGTTAAA 


AACATGAT 






278 



50 2) INFORMATION FOR SEQ ID NO: 1812 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 270 bases 
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10 



20 



45 



(B) TYPE: Nucleic' acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus cereus 

(B) STRAIN: ATCC 15816 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1812 



ATGGCTAAAG 


CTAAATTCGA 


ACGTTCTAAA 


CCCCATGTTA 


ACATCGGTAC 


50 


AATCGGCCAC 


GTTGACCATG 


GTAAAACTAC 


ATTAACTGCT 


GCGATCACTA 


100 


15 CAGTACTTGC 


AAAAGCTGGT 


GGTGCTGAAG 


CACGCGGATA 


CGATCAAATC 


150 


GATGCTGCTC 


CAGAAGAAAG 


AGAGCGCGGT 


ATCACAATCT 


CAACTGCACA 


200 


CGTTGAGTAC 


GAAACTGAAA 


CTCGTCACTA 


TGCACACGTT 


GACTGCCCAG 


250 


GTCACGCTGA 


CTATGTTAAA 








270 



2) INFORMATION FOR SEQ ID NO: 1813 



(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 278 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

30 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus weihenstephanensis 

(B) STRAIN: WSBC 10204 

35 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1813 



ATGGCTAAAG 


CTAAATTCGA 


ACGTTCTAAA 


CCCCATGTTA 


ACATCGGTAC 


50 


AATCGGCCAC 


GTTGACCATG 


GTAAAACTAC 


ATTAACTGCT 


GCGATCACTA 


100 


40 CAGTTCTTGC 


AAAAGCTGGT 


GGTGCTGAAG 


CACGCGGATA 


CGATCAAATC 


150 


GACGCTGCTC 


CAGAAGAAAG 


AGAGCGCGGA 


ATCACAATCT 


CAACTGCACA 


200 


CGTTGAGTAC 


GAAACTGAAA 


CTCGTCACTA 


TGCACACGTT 


GACTGCCCAG 


250 


GTCATGCTGA 


CTATGTTAAA 


AACATGAT 






278 



2) INFORMATION FOR SEQ ID NO: 1814 



(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 266 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus cereus 

(B) STRAIN: ATCC 13472 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1816 



ATGGCTAAAG CTAAATTCGA 
AATCGGCCAC GTTGACCATG 
CAGTTCTTGC AAAAGCTGGT 
GATGCTGCTC CAGAAGAAAG 
CGTTGAGTAC GAAACTGAAA 
GTCACGCTGA CTATGTTA 



ACGTTCTAAA CCCCATGTTA ACATCGGTAC 
GTAAAACTAC ATTAACTGCT GCGATCACTA 
GGTGCTGAAG CACGCGGATA CGATCAAATC 
AGAGCGCGGT ATCACAATCT CAACTGCACA 
CTCGTCACTA TGCACACGTT GACTGCCCAG 



50 
100 
150 
200 
250 
268 



2) INFORMATION FOR SEQ ID NO: 1817 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 278 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus anthracis 

(B) STRAIN: ATCC 4229 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1817 

ATGGCTAAAG CTAAATTCGA ACGTTCTAAA CCCCATGTTA ACATCGGTAC 50 

AATCGGCCAC GTTGACCATG GTAAAACTAC ATTAACTGCT GCGATCACTA 100 

CAGTACTTGC AAAAGCTGGT GGTGCTGAAG CACGCGGATA CGATCAAATC 150 

GATGCTGCTC CAGAAGAAAG AGAGCGCGGT ATCACAATCT CAACTGCACA 200 

CGTTGAGTAC GAAACTGAAA CTCGTCACTA TGCACACGTT GACTGCCCAG 250 

GTCACGCTGA CTATGTTAAA AACATGAT 278 



2) INFORMATION FOR SEQ ID NO: 1818 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 268 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus cereus 

(B) STRAIN: ATCC 14579 . 
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(xi) SEQUENCE DESCRIPTION: SEQ 

ATGGCTAAAG CTAAATTCGA ACGTTCTAAA 
5 AATCGGCCAC GTTGACCATG GTAAAACTAC 
CAGTTCTTGC AAAAGCTGGT GGTGCTGAAG 
GATGCTGCTC CAGAAGAAAG AGAGCGCGGT 
CGTTGAGTAC GAAACTGAAA CTCGTCACTA 
GTCACGCTGA CTATGTTA 
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ID NO: 1818 

CCCCATGTTA ACATCGGTAC 50 

ATTAACTGCT GCGATCACTA 100 

CACGCGGATA CGATCAAATC 150 

ATCACAATCT CAACTGCACA 200 

TGCACACGTT GACTGCCCAG 250 

268 



2) INFORMATION FOR SEQ ID NO: 1819 



15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 278 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



20 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus anthracis 
25 (B) STRAIN: CIP 9444 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1819 



ATGGCTAAAG 


CTAAATTCGA 


ACGTTCTAAA 


CCCCATGTTA 


ACATCGGTAC 


50 


30 AATCGGCCAC 


GTTGACCATG 


GTAAAACTAC 


ATTAACTGCT 


GCGATCACTA 


100 


CAGTACTTGC 


AAAAGCTGGT 


GGTGCTGAAG 


CACGCGGATA 


CGATCAAATC 


150 


GATGCTGCTC 


CAGAAGAAAG 


AGAGCGCGGT 


ATCACAATCT 


CAACTGCACA 


200 


CGTTGAGTAC 


GAAACTGAAA 


CTCGTCACTA 


TGCACACGTT 


GACTGCCCAG 


250 


GTCACGCTGA 


CTATGTTAAA 


AACATGAT 






278 



35 



2) INFORMATION FOR SEQ ID NO: 1820 



4 0 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 278 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



45 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus pseudomycoides 
50 (B) STRAIN: NRRL B-617 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1820 
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ATGGCTAAAG CTAAATTCGA ACGTTCTAAA 
AATCGGCCAC GTTGACCATG GTAAAACTAC 
CAGTTCTTGC AAAAGCTGGT GGTGCTGAAG 
GATGCTGCTC CAGAAGAAAG AGAGCGCGGT 
CGTTGAGTAC GAAACTGAAA CTCGTCACTA 
GTCACGCTGA CTATGTTAAA AACATGAT 



PCT/CAOO/01150 

CCCCATGTTA ACATCGGTAC 50 
ATTAACTGCT GCGATCACTA 100 
CACGCGGATA CGACCAAATC 150 
ATCACAATCT CAACTGCACA .200 
TGCACACGTT GACTGCCCAG 250 

278 



2) INFORMATION FOR SEQ ID NO: 1821 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 263 bases 

(B) TYPE: • Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus cereus 

(B) STRAIN: ATCC 49064 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1821 

ATGGCTAAAG CTAAATTCGA ACGTTCTAAA CCCCATGTTA ACATCGGTAC 50 

AATCGGCCAC GTTGACCATG GTAAAACTAC ATTAACTGCT GCGATCACTA 100 

CAGTACTTGC AAAAGCTGGT GGTGCTGAAG CACGCGGATA CGATCAAATC 150 

GATGCTGCTC CAGAAGAAAG AGAGCGCGGT ATCACAATCT CAACTGCACA 200 

CGTTGAGTAC GAAACTGAAA CTCGTCACTA TGCACACGTT GACTGCCCAG 250 

GTCACGCTGA CTA 263 



2) INFORMATION FOR SEQ ID NO: 1822 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1668 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus oralis 

(B) STRAIN: ATCC 35037 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1822 

CAATCGAAGT ACAACGTTCT CTTCGWGTAT TGGACGGTGC GGTTACTGTT 50 
CTTGACTCAC AATCAGGTGT TGAGCCTCAA ACTGAAACAG TTTGGCGTCA 100 
AGCAACTGAG TACGGAGTTC CACGTATCGT ATTTGCTAAC AAAATGGACA 150 
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AAATCGGTGC TGACTTCCTT TACTCAGTAA GCACACTTCA CGACCGTCTT 200 

CAAGCAAACG CACACCCAAT CCAATTGCCA ATCGGTGCTG AAGATGACTT 250 

CCGTGGTATC ATCGACTTGA TCAAGATGAA AGCTGAAATC TATACTAACG 300 

ACCTTGGTAC AGATATCCTT GAAGAAGATA TTCCAGCTGA ATACCTTGAC 350 

5 CAAGCTCAAG AATACCGTGA AAAATTGGTT GAAGCAGTCG CTGAAACTGA 400 

TGAAGACTTG ATGATGAAAT ACCTTGAAGG TGAAGAAATC ACTAACGAAG 4 50 

AATTGAAAGC TGCTATCCGT AAAGCAACTA TCAACGTTGA ATTCTTCCCA 500 

GTATTGTGTG GTTCTGCCTT CAAGAACAAG GGTGTTCAAT TGATGCTTGA 550 

TGCGGTTATC GACTACCTTC CAAGCCCACT TGATATCCCA GCGATCAAAG 600 

10 GTATCAACCC AGATWCAGAT GAAGAAGAAA CTCGTCCAGC ATCTGACGAA 650 

GAGCCATTCG CAGCTCTTGC CTTCAAGATC ATGACGGACC CATTTGTAGG 700 

TCGTTTGACA TTCTTCCGTG TATACTCARG TGTTCTCCAA TCARGKTCTT 750 

ACGTATTGAA CACATCTAAA GGTAAACGTG AACGTATCGG ACGTATCCTT 800 

CAAATGCACG CTAACAGCCG TCAAGAAATT GACACTGTTT ACTCAGGTGA 850 

15 TATCGCTGCT GCCGTTGGTT TGAAAGATAC TYCAACTGGT GACTCATTGM 900 

CAGATGAAAA AGCTAAAATC ATCCTTGAGT CAATCAACGT TCCAGAMCCA 950 

GTTATCCAAT TGATGGTTGA GCCAAAATCT AAAGCTGACC AAGATAAGAT 1000 

GGGTATCGCC CTTCAAAAAT TGGCTGAAGA AGATCCAACA TTCCGCGTTG 1050 

AAMCAAACGT TGAAACTGGT GAAMCAGTTA TCTCAGGTAT GGGTGAGCTT 1100 

2 0 CACTTGGACG TCCTTGTTGA CCGTATGCGT CGTGAGTTCA AAGTGGAAGC 1150 

GAACGTAGGT GCTCCTCAAG TATCTTACCG TGAAACATTC CGCGCTTCTA 1200 

CTCAAGCACG TGGATTCTTC AAACGTCAGT CTGGTGGTAA AGGTCAATTC 1250 

GGTGATGTAT GGATTGAATT TACTCCAAAC GAAGAAGGTA AAGGATTCGA 1300 

ATTCGAAAAC GCAATCGTCG GTGGTGTGGT TCCTCGTGAA TTTATCCCAG 1350 

25 CGGTTGAAAA AGGTTTGGTA GAATCTATGG CTAACGGTGT TCTTGCAGGT 1400 

TACCCAATGG TTGACGTTAA AGCTAAGCTT TACGATGGTT CATACCACGA 1450 

TGTCGACTCA TCTGAAACTG CCTTCAAGAT CGCGGCTTCA CTTGCCCTTA 1500 

AAGAAGCTGC TAAGTCAGCA CAACCAGCTA TCCTTGAGCC AATGATGCTT 1550 

GTAACCATCA CTGTTCCAGA AGAAAACCTT GGTGATGTTA TGGGTCACGT 1600 

30 AACTGCTCGT CGTGGACGTG TAGATGGTAT GGAAGCACWC GGTAACAGCC 1650 

AAATCGTTCG TGCTTACG 1668 



35 2) INFORMATION FOR SEQ ID NO: 1823 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 115 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

4 5 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Budvicia aquatica 

(B) STRAIN: ATCC 35567 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1823 

AGACCTGCGT TCACAAACAC AGGGTCGTGC TTCTTACTCT ATGGAGTTCT 50 
TGAAGTACAA CGAAGCGCCA AACAACGTTG CTACAGCAAT CATTGAAGCT 100 
CGTAAGGCTA GATAA 115 
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2 ) INFORMATION FOR SEQ ID NO: 1824 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 107 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

<vi) ORIGINAL SOURCE: 

(A) ORGANISM: Buttiauxella agrestis 

(B) STRAIN: ATCC 33320 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1824 

CTGCGTTCAC TGACCAAGGT CGTGCATCTT ACTCCATGGA ATTCCTGAAG 50 
TATGATGACG CGCCAAACAA CGTAGCTCAG GCCGTAATCG AAGCTCGCGG 100 
TAAATAA 107 



2) INFORMATION FOR SEQ ID NO: 1825 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 9 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella oxytoca 

(B) STRAIN: ATCC 13182 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1825 

TTACTCCATG GAGTTCCTGA AGTATGATGA TGCGCCGAAC AACGTTGCTC 50 
AGGCCGTAAT CGAAGCCCGT GGTAAATAA 79 



2) INFORMATION FOR SEQ ID NO: 1826 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 111 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Plesiomonas shigelloides 

(B) STRAIN: ATCC 14029 

5. (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1826 

CAGCTGCGTT CTCTGACCAA AGGTCGTGCA TC AT AC ACT A TGGAATTCCT 50 
GAAGTATGAT GATGCGCCAA ACAACGTTGC TCAGGCCGTT ATTGAAGCCC 100 
GTGGTAAGTA A 111 



2) INFORMATION FOR SEQ ID NO:' 1827 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 108 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



20 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Shewanella putrefaciens 
25 (B) STRAIN: ATCC 8071 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1827 

GATTTGCGCT CTGCAACTCA TGGGCGTGCT TCGTACTCCA TGGAGTTCTT 50 
30 GAAGTACTCT GATGCACCGC AAAACATTGC GAAAGCGATT ATTGAATCTC 100 
GTAGCTAA' 108 



35 2) INFORMATION FOR SEQ ID NO: 1828 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 113 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

45 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Obesvmbacterivm proteus 

(B) STRAIN: ATCC 12841 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1828 

CTCAGCTGCG TTCTCTGACC AAAGGTCGTG CATCTTACTC CATGGAATTC 50 
CTGAAGTATG ATGATGCGCC TAACAACGTT GCTCAGGCCG TTATTGAAGC 100 
TCGTGGCAAA TAA 113 
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2) INFORMATION FOR SEQ ID NO: 1829 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 70 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella oxytoca 

(B) STRAIN: ATCC 13182 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1829 

GCCGCAGGGT TAAAACCAAA GTCCCGTGCT CTCTCCTGAA GGGGAGAGCA 50 
CTATAGTAAG GAATATAGCC 70 



2) INFORMATION FOR SEQ ID NO: 1830 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 66 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Budvicia aquatica 

(B) STRAIN: ATCC 35567 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1830 

GCCTCGGGTA AAACTTATAT CCCAGTCCCC CTCGTATAGA GGGGGATAGA 50 
GTAAAGGAAG ATAATC 66 



2) INFORMATION FOR SEQ ID NO: 1831 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 81 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
(vi) ORIGINAL SOURCE: 
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(A) ORGANISM: Plesiomonas shigelloides 

(B) STRAIN: ATCC 14029 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1831 



TCCACAGGAT TAAAACCCAG GTTTAAACCT AAGTCCCGTG CTCTCTCCTC 
AGGGGAGAGC ACAATAGTAA GGAATATAGC C 



50 
81 



2) INFORMATION FOR SEQ ID NO: 1832 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 70 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Obesumbacterium proteus 

(B) STRAIN : ATCC 12841 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1832 

GCTACTAGTT TAAAACATTG ATCCCGTGCT CTCTCTATGA AGGGAGAGCA 50 
CAAGAGTAAG GAATAAAGCC 70 



2) INFORMATION FOR SEQ ID NO: 1833 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 72 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Shewanella putrefaciens 

(B) STRAIN: ATCC 8071 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1833 

TTTCCAGTTA CGACATAAAT GTTATTATGG TCCAGCTTTG ACTGGACTAT 50 
TCTGAAAAGA AAGGAATATA TC 72 



2) INFORMATION FOR SEQ ID NO: 1834 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 73 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Buttiauxella agrestis 

(B) STRAIN: ATCC 33320 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1834 

GCCCCGGGTT TTAAAAAACA TTGATCCCGT GCTCTCTCCA GAAGGGGAGA 50 
GCGCAACAGT AAGGAATATA GCC 73 



2) INFORMATION FOR SEQ ID NO: 1835 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 795 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Campylobacter coli 

(B) STRAIN: ATCC 43479 

(xi) SEQUENCE DESCRIPTION: SEQ IDvNO: 1835 

CTGCAGCTGA TGGTCCTATG CCACAAACTA GAGAGCACAT CCTTCTATCA 50 

CGCCAAGTAG GTGTTCCATA TATCGTTGTA TTTATGAATA AAGCAGATAT 100 

GGTTGATGAT GCTGAACTTT TAGAATTGGT TGAAATGGAA ATTAGAGAAT 150 

TATTAAGCTC TTATGATTTC CCAGGTGATG ACACACCTAT TATTTCAGGT 200 

TCTGCTTTAA AAGCTCTTGA AGAAGCAAAA GCTGGACAAG ATGGCGAATG 250 

GTCAGCGAAA ATTATGGATC TTATGGCTGC TGTTGATAGC TATATTCCAA 300 

CTCCAACTCG TGACACTGAA AAAGATTTCT TAATGCCAAT TGAAGATGTT • 350 

TTCTCAATTT CAGGTCGTGG TACTGTTGTT ACAGGTAGAA TTGAAAAAGG 400 

TATTGTAAAA GTTGGTGATA CTATAGAAAT CGTTGGTATT AAAGATACTC 450 

AAACAACAAC TGTAACTGGC GTTGAAATGT TTAGAAAAGA AATGGACCAA 500 

GGTGAAGCAG GGGATAATGT TGGTGTTCTT CTTCGTGGTA CAAAAAAAGA 550 

AGAAGTTATC CGCGGTATGG TTCTTGCTAA ACCAAAATCA ATTACTCCAC 600 

ATACTGATTT CGAAGCTGAA GTTTATATCC TAAATAAAGA TGAGGGTGGT 650 

AGACATACTC CATTCTTTAA TAACTATAGA CCGCAATTCT ATGTAAGAAC 700 

AACAGATGTA ACAGGTTCTA TTAAATTAGC TGATGGCGTT GAAATGGTTA 750 

TGCCTGGTGA AAATGTAAGA ATTACTGTAA GCTTGATTGC ACCAG 7 95 
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2) INFORMATION FOR SEQ ID NO: 1836 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 817 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Campylobacter fetus subsp. fetus 

(B) STRAIN: ATCC 25936 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1836 

GCCATACTAG TTGTTTCTGC AGCTGATGGC CCAATGCCAC AAACTAGAGA 50 

GCACATTTTG CTATCTCGTC AAGTTGGTGT TCCATATATA GTTGTTTTTA 100 

TGAACAAAGC TGATATGGTA GATGACGCAG AGTTGCTAGA ATTAGTTGAA 150 

20 ATGGAGATCA GAGAGTTATT AAGCGAATAT GACTTCCCTG GTGATGATAC 200 

TCCTATTATA AGCGGATCAG CACTTCAAGC TCTTGAAGAA GCTAAAGCTG 250 

GTAATGATGG CGAATGGTCA GCTAAGATTA TGGATCTTAT GGCTGCTGTT 300 

GATAGCTACA TACCAACTCC AGTTCGTGCT ACTGATAAAG ATTTCTTAAT 350 

GCCGATTGAA GACGTATTCT CAATTTCTGG CCGTGGTACT GTTGTTACTG 400 

25 GTAGAATTGA AAAAGGTATA GTTAAAGTTG GTGATACTAT CGAAATCGTA 4 50 

GGTATTAGAG ATACACAAAC TACAACAGTT ACCGGCGTTG AAATGTTTAG 500 

AAAAGAAATG GATCAAGGCG AGGCTGGTGA TAACGTTGGT GTTCTTTTAC 550 

GCGGTACAAA GAAAGAAGAC GTTGAAAGAG GTATGGTTCT TTGTAAGCCA 600 

AAATCAATTA CTCCTCATAC TAAATTTGAG GGAGAAGTTT ATATCTTGAC 650 

30 TAAGGAAGAG GGCGGTAGAC ATACTCCATT CTTCAACAAC TATAGACCAC 700 

AATTTTATGT AAGAACAACA GATGTTACTG GATCAATCAC TCTTCCAGAG 750 

GGTACTGAGA TGGTTATGCC TGGTGATAAC TTAAAAATCA CTGTTGAGTT 800 

AATCAACCCA GTTGCTC 817 



2) INFORMATION FOR SEQ ID NO: 1837 



(i) SEQUENCE CHARACTERISTICS: 
i0 (A) LENGTH: 798 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

45 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Campylobacter fetus subsp. venerealis 

(B) STRAIN: ATCC 33561 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1837 
CCATACTAGT TGTTTCTGCA GCTGATGGCC CAATGCCACA AACTAGAGAG 50 
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CACATTTTGC TATCTCGTCA 
GAACAAAGCT GATATGGTAG 
TGGAGATCAG AGAGTTATTA 
CCTATTATAA GCGGATCAGC 
TAATGATGGC GAATGGTCAG 
ATAGCTACAT ACCAACTCCA 
CCGATTGAAG ACGTATTCTC 
TAGAATTGAA AAAGGTATAG 
GTATTAGAGA TACACAAACT 
AAAGAAATGG ATCAAGGCGA 
CGGTACAAAG AAAGAAGACG 
AATCAATTAC TCCTCATACT 
AAGGAAGAGG GCGGTAGACA 
ATTTTATGTA AGAACAACAG 
GTACTGAGAT GGTTATGCCT 



AGTTGGTGTT CCATATATAG 
ATGACGCAGA GTTGCTAGAA 
AGCGAATATG ACTTCCCTGG 
ACTTCAAGCT CTTGAAGAAG 
CTAAGATTAT GGATCTTATG 
GTTCGTGCTA CTGATAAAGA 
GATTTCTGGC CGTGGTACTG 
TTAAAGTTGG TGATACTATC 
ACAACAGTTA CCGGCGTTGA 
GGCTGGTGAT AACGTTGGTG 
TTGAAAGAGG TATGGTTCTT 
AAATTTGAGG GAGAAGTTTA 
TACTCCATTC TTCAACAACT 
ATGTTACTGG ATCAATCACT 
GGTGATAACT TAAAAATCAC 



TTGTTTTTAT 100 

TTAGTTGAAA 150 

TGATGATACT 200 

CTAAAGCTGG 250 

GCTGCTGTTG 300 

TTTCTTAATG 350 

TTGTTACTGG 400 

GAAATCGTAG 4 50 

AATGTTTAGA 500 

TTCTTTTACG 550 

TGTAAGCCAA 600 

TATCTTGACT 650 

ATAGACCACA 700 

CTTCCAGAGG 750 

TGTTAGTT 7 98 



2) INFORMATION FOR SEQ ID NO: 1838 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1116 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
25 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
30 (A) ORGANISM: Buttiauxella agrestis 

(B) STRAIN: ATCC 33320 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1838 

35 GTGTCTAAAG AAAAATTTGA ACGTACAAAA CCGCACGTCA ACGTCGGTAC 50 

TATCGGCCAC GTTGACCATG GTAAAACTAC TCTGACTGCA GCAATCACTA 100 

CCGTTCTGGC TAAAACCTAC GGCGGTTCTG CACGCGCATT CGACCAGATC 150 

GATAACGCAC CAGAAGAAAA AGCTCGTGGT ATCACCATCA ACACTTCCCA 2 00 

CGTTGAATAT GACACCCCGA CTCGTCACTA CGCGCACGTT GACTGCCCAG 250 

4 0 GGCACGCCGA CTACGTTAAA AACATGATCA CCGGTGCTGC TCAGATGGAC 300 

GGCGCTATCC TGGTTGTTGC TGCGACTGAT GGCCCGATGC CACAGACTCG 350 

TGAGCACATC CTGCTGGGTC GCCAGGTTGG CGTTCCATTC ATGATCGTGT 400 

TCATGAACAA ATGTGACATG GTTGATGACG AAGAGCTGCT GGAACTGGTA 450 

GAAATGGAAG TTCGTGAACT TCTGTCTGCT TATGATTTCC CGGGCGACGA 500 

4 5 CATCCCAGTG GTTCGTGGTT CAGCTCTGAA AGCGCTGGAA GGCGAAGCAG 550 

AGTGGGAAGC TAAAATCATC GAGCTGGCTG GTCACCTGGA TAACTACATC 600 

CCAGAACCAG AGCGTGCTAT CGACAAGCCA TTCCTGCTGC CAATCGAAGA 650 

CGTATTCTCC ATCTCCGGCC GTGGTACTGT TGTTACTGGT CGTGTAGAGC 700 

GTGGTATCAT TAAAGTTGGT GAAGAAGTAG AAATCGTTGG TATCAAAGAT 750 

50 ACCGTGAAAT CTACCTGTAC TGGCGTTGAA ATGTTCCGCA AACTGCTGGA 800 

CGAAGGCCGT GCTGGTGAGA ACGTTGGTGT TCTGCTGCGT GGTATTAAAC 850 

GTGAAGATAT CGAACGTGGT CAGGTTCTGG CTAAGCCAGG CTCTATCAAG 900 

CCGCACACTC AGTTCGAATC AGAAGTTTAT ATCCTGTCCA AAGATGAAGG 950 
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CGGCCGTCAT ACTCCGTTCT TCAAAGGCTA 
GTACAACTGA CGTGACTGGC ACCATCGAAC 
GTAATGCCGG GCGACAACAT TCAAATGGTT 
CGCAATGGAC GACGGT 
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CCGTCCACAG TTCTACTTCC 1000 
TGCCAGAAGG CGTTGAGATG 1050 
GTTACCCTGA TCCACCCAAT 1100 

1116 



2) INFORMATION FOR SEQ ID NO: 1839 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1109 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



15 



(ii) MOLECULE TYPE:* Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella OKytoca 
20 (B) STRAIN: ATCC 13182 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1839 

GTGTCTAAAG AAAAATTTGA ACGTACAAAA CCGCACGTCA ACGTCGGTAC 50 

25 TATCGGCCAC GTTGACCATG GTAAAACTAC TCTGACCGCT GCAATCACTA 100 

CCGTTCTGGC TAAAACCTAC GGTGGTGCTG CTCGCGCATT CGACCAGATC 150 

GATAACGCGC CGGAAGAAAA AGCTCGTGGT AT C AC CATC A ACACTTCCCA 200 

CGTTGAATAT GACACCCCGA CTCGCCACTA CGCGCACGTA GACTGCCCGG 250 

GCCACGCCGA CTATGTTAAA AACATGATCA CCGGTGCTGC GCAGATGGAC 300 

30 GGCGCGATCC TGGTTGTTGC TGCGACTGAC GGCCCGATGC CGCAGACTCG 350 

TGAGCACATC CTGCTGGGTC GTCAGGTAGG CGTTCCGTAC ATCATCGTGT 400 

TCCTGAACAA GTGCGACATG GTTGATGACG AAGAGCTGCT GGAACTGGTT 4 50 

GAAATGGAAG TTCGTGAACT TCTGTCTCAG TACGATTTCC CGGGCGACGA 500 

CACTCCGATC GTTCGTGGTT CTGCTCTGAA AGCTCTGGAA GGCGACGCTG 550 

35 AGTGGGAATC TAAAATCATC GAACTGGCTG GCTTCCTGGA TTCTTATATT 600 

CCGGAACCAG AGCGTGCGAT TGACAAGCCG TTCCTGCTGC CGATCGAAGA 650 

CGTATTCTCC ATCTCCGGTC GTGGTACCGT TGTTACCGGT CGTGTAGAGC 700 

GCGGTATCAT CAAAGTTGGC GAAGAAGTTG AAATCGTTGG TATCAAAGAC 750 

ACTGCTAAGT CTACCTGTAC TGGCGTTGAA ATGTTCCGCA AACTGCTGGA 800 

4 0 CGAAGGCCGT GCTGGTGAGA ACGTTGGTGT TCTGCTGCGT GGTATCAAAC 850 

GTGAAGAAAT CGAACGTGGT CAGGTACTGG CTAAGCCGGG CTCTATCAAG 900 

CCGCACACCA AGTTCGAATC TGAAGTTTAT ATCCTGTCCA AAGACGAAGG 950 

CGGCCGTCAC ACTCCGTTCT TCAAAGGCTA CCGTCCGCAG TTCTACTTCC 1000 

GTACAACTGA CGTGACTGGC ACCATCGAAC TGCCGGAAGG CGTAGAGATG 1050 

45 GTTATGCCGG GCGACAACAT CAAAATGGTT GTTACCCTGA TCCACCCGAT 1100 

CGCGATGGA 1109 



50 2) INFORMATION FOR SEQ ID NO: 1840 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1108 bases 

960 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CAOO/01150 



10 



(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Plesiomonas shigelloides 

(B) STRAIN: ATCC 14029 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1840 



GTGTCTAAAG AAAAATTTGA ACGTACTAAA, CCGCACGTTA ACGTTGGTAC 50 

TATCGGCCAC GTTGACCACG GTAAAACTAC CCTGACTGCA GCTATCACTA 100 

15 CCGTACTGTC TAAAGTATAC GGTGGTCAGG CTCGTGCATT CGATCAGATC 150 

GATAACGCGC CAGAAGAAAA AGCTCGTGGT AT C AC CATC A ACACTTCTCA 200 

CGTAGAGTAC GACACCCCAA CTCGTCACTA CGCGCACGTT GACTGCCCAG 250 

GTCACGCCGA CTACGTGAAG AACATGATCA CTGGTGCTGC TCAGATGGAC 300 

GGCGCTATCC TGGTAGTAGC TGCGACTGAC GGCCCAATGC CTCAGACTCG 350 

20 TGAGCACATC CTGCTGGGTC GTCAGGTAGG CGTTCCTTAC ATCATCGTGT 4 00 

TCCTGAACAA GTGTGACATG GTTGATGACG AAGAGCTGCT GGAACTGGTA 4 50 

GAAATGGAAG TACGTGAGCT GCTGTCTCAG TACGACTTCC CAGGCGACGA 500 

TACTCCAGTT GTTCGCGGTT CTGCACTGAA AGCGCTGGAA GGCGATGCTC 550 

AGTGGGAAGA GAAGATTGTT GAACTGGCAG GCTACCTGGA CAGCTACATC 600 

25 CCTGAGCCAG AGCGTGCTAT CGACAAGCCA TTCCTGCTGC CAATCGAAGA 650 

CGTATTCTCT ATCTCCGGCC GTGGTACTGT AGTAACTGGT CGTGTAGAGC 700 

GCGGTATCAT CAAAGTTGGT GAAGAAGTAG AAATCGTTGG TATCAAAGAG 750 

ACTACCAAGA CTACTTGTAC TGGCGTTGAA ATGTTCCGTA AGCTGCTGGA 800 

CGAAGGTCGT GCGGGCGAGA ACGTAGGTGT TCTGCTGCGT GGTACCAAGC 850 

30 GTGATGACGT AGAGCGTGGT CAGGTTCTGG CTAAGCCAGG CTCAATCAAC 900 

CCACACACCA ACTTTGTAGC AGAAGTTTAT ATTCTGTCCA AAGATGAAGG 950 

TGGTCGTCAC ACTCCATTCT TCAAAGGCTA CCGTCCACAG TTCTACTTCC 1000 

GTACAACTGA CGTGACCGGT ACCATCGAAC TGCCAGAAGG CGTAGAGATG 1050 

GTAATGCCAG GTGACAACAT TCAAATGGTT GTTACCCTGA TTGCACCAAT 1100 

35 CGCGATGG 1108 



2) INFORMATION FOR SEQ ID NO: 1841 



30 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1107 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
15 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
50 (A) ORGANISM: Shewanella putrefaciens 

(B) STRAIN: ATCC 8071 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1841 

961 
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TGGCAAAAGC TAAATTTGAA CGTATTAAGC CTCACGTAAA CGTGGGCACC 50 

ATTGGTCACG TTGACCATGG TAAAACCACT CTGACTGCAG CTATCTCTCA 100 

CGTACTGGCT AAGACCTACG GTGGCGAAGC TAAAGACTTC TCTCAAATCG 150 

5 ATAACGCTCC AGAAGAGCGT GAGCGCGGTA TTACCATCAA TACCTCTCAC 200 

ATCGAATATG ACACGCCATC ACGCCACTAC GCCCACGTAG ACTGCCCAGG 250 

CCACGCTGAC TATGTTAAAA ACATGATCAC TGGTGCTGCA CAGATGGACG 300 

GCGCGATTCT GGTAGTCGCT TCAACAGACG GTCCAATGCC ACAGACTCGT 350 

GAGCACATCC TGCTTTCTCG TCAGGTTGGC GTACCATTCA TCATCGTATT 400 

10 CATGAACAAA TGTGACATGG TAGATGACGA AGAGCTGTTA GAGCTAGTTG 450 

AGATGGAAGT GCGTGAACTG TTATCAGAAT ACGATTTCCC AGGTGATGAC 500 

TTACCGGTAA TCCAAGGTTC AGCTCTGAAA GCGCTAGAAG GCGAGCCAGA 550 

GTGGGAAGCA AAAATCCTTG AATTAGCAGC GGCGCTGGAT TCTTACATTC 600 

CAGAACCACA ACGTGACATC GATAAGCCGT TCCTACTGCC AATCGAAGAC 650 

15 GTATTCTCAA TTTCAGGCCG TGGTACAGTA GTAACAGGTC GTGTTGAGCG 700 

TGGTATTGTA CGCGTAGGCG ACGAAGTTGA AATCGTTGGT GTACGTGCGA 750 

CAACTAAGAC AACGTGTACT GGTGTAGAAA TGTTCCGTAA ACTGCTTGAC 800 

GAAGGTCGTG CAGGTGAGAA CTGTGGTATT TTGTTACGTG GTACTAAGCG 850 

TGATGACGTA GAACGTGGTC AAGTATTAGC GAAGCCAGGT TCAATCAACC 900 

20 CACACACTAC TTTTGAATCA GAAGTTTACG TACTGTCAAA AGAAGAAGGT 950 

GGTCGTCACA CGCCATTCTT CAAAGGCTAC CGTCCACAGT TCTACTTCCG 1000 

TACAACTGAC GTAACCGGTA CTATCGAACT GCCAGAAGGC GTAGAGATGG 1050 

TAATGCCAGG CGATAACATC AAGATGGTAG TGACACTGAT TTGCCCAATC 1100 

GCGATGG 1107 

25 



2) INFORMATION FOR SEQ ID NO: 1842 



30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1116 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



35 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Obesumbacterium proteus 
40 (B) STRAIN: ATCC 12841 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1842 

GTGTCTAAAG AAAAATTTGA ACGTACAAAA CCGCACGTTA ACGTTGGTAC 50 

45 AATCGGCCAC GTTGACCACG GTAAAACTAC CCTGACTGCT GCAATCACTA 100 

CCGTTCTGGC TAAAACCTAC GGTGGTTCTG CACGTGCATT CGACCAGATC 150 

GATAACGCGC CAGAAGAAAA AGCTCGTGGT ATCACCATCA ACACTTCTCA 200 

CGTTGAATAT GACACCCCGA CTCGCCACTA CGCACACGTA GACTGCCCAG 250 

GCCACGCCGA CTATGTTAAA AACATGATCA CCGGTGCTGC GCAGATGGAC 300 

50 GGCGCGATCC TGGTTGTTGC TGCGACTGAC GGCCCTATGC CTCAGACTCG 350 

TGAGCACATC CTGCTGGGTC GTCAGGTTGG CGTTCCTTAC ATCATCGTAT 400 

TCCTGAACAA ATGCGACATG GTTGATGATG AAGAGCTGCT GGAGCTGGTA 450 

GAAATGGAAG TTCGTGAACT TCTGTCTCAG TACGACTTCC CAGGCAATGA 500 

962 
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TACTCCAATC ATCCGTGGTT CTGCTCTGAA AGCGCTGGAA GGCGAAGCTG 550 

AGTGGGAAGC TAAGATCGTA GAACTGGCTG AAACTCTGGA TTCTTACATC 600 

CCAGAACCAG AACGTGCTAT CGACAAGCCA TTCCTGCTGC CAATCGAAGA 650 

CGTATTCTCA ATCTCTGGCC GTGGTACTGT TGTTACCGGT CGTGTAGAGC 700 

5 GCGGTATCGT TAAAGTTGGT GAAGAAGTTG AGATTGTTGG TATCAAAGAT 750 

ACCGTTAAAT CAACTTGTAC CGGCGTTGAA ATGTTCCGTA AACTGCTGGA 800 

CGAAGGTCGT GCAGGCGAGA ACGTTGGTGT TCTGCTGCGT GGTATCAAGC 850 

GTGAAGACAT CGAACGTGGT CAGGTTCTGG CTAAACCAGG TTCTATCAAA 900 

CCACACACCA AGTTCGAATC AGAAGTTTAT ATTCTGAGCA AAGATGAAGG 950 

10 CGGCCGTCAT ACTCCGTTCT TCAAAGGCTA CCGTCCACAG TTCTACTTCC 1000 

GTACAACTGA CGTGACCGGT ACCATCGAAC TGCCAGAAGG CGTGGAAATG 1050 

GTAATGCCAG GCGATAACAT CAAAATGATC GTTACCCTGA TCCACCCAAT 1100 

CGCAATGGAC GATGGT 1116 



15 



30 



2) INFORMATION FOR SEQ ID NO: 1843 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 1129 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Budvicia aquatica 

(B) STRAIN: ATCC 35567 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1843 



GTGTCTAAAG AAAAATTTGA ACGTACAAAA CCGCACGTTA ACGTTGGTAC 50 

TATCGGCCAC GTTGACCACG GTAAAACAAC TCTGACTGCT GCTATCACCA 100 

35 GCGTTTTAGC TAAAACTTAT GGCGGTAACG CTCGTGCATT CGATCAAATC 150 

GATAATGCAC CAGAAGAAAA AGCACGTGGT ATCACCATCA ACACTTCTCA 200 

CGTTGAGTAT GATACTCCTG CTCGCCACTA CGCACACGTA GACTGCCCAG 250 

GACACGCCGA CTATGTGAAA AACATGATCA CCGGTGCTGC TCAAATGGAC 300 

GGCGCGATCT TAGTTGTTGC GGCAACTGAT GGTCCTATGC CACAGACTCG 350 

4 0 TGAGCACATC CTGTTAGGTC GTCAGGTTGG CGTTCCTTAC ATCATCGTGT 400 

TCCTGAACAA GTGTGACATG GTTGATGACG AAGAACTGTT AGAATTAGTT 450 

GAAATGGAAG TTCGTGAGCT TCTTTCTGCT TATGATTTCC CTGGTGACGA 500 

TACTCCAGTT GTTCGTGGTT CTGCGCTGAA AGCGTTAGAA GGCGAAGCTG 550 

AGTGGGAAGC TAAGATCATT GAATTAGCCG GATATCTGGA TAGCTACATC 600 

45 CCAGAGCCAG AGCGTGCGAT TGACCGTCCG TTCCTGCTGC CAATCGAAGA 650 

CGTATTCTCT ATTTCAGGCC GTGGTACAGT TGTTACCGGT CGTGTAGAGC 700 

GCGGAATCGT TAAAGTCGGT GAAGCCGTTG AGATTGTTGG TATCAAAGAT 750 

ACCGTACAAA CTACCTGTAC TGGCGTTGAA ATGTTCCGTA AGTTACTTGA 800 

CGAAGGCCGT GCTGGTGAGA ACGTTGGTGT TCTGCTGCGT GGTACTAAGC 850 

50 GTGAAGAAAT CGAACGCGGT CAAGTACTGG CTAAGCCAGG TTCAATCAAC 900 

CCGCACACCA ACTTTGTATC AGAAGTTTAT ATTCTGAGCA AAGATGAAGG 950 

TGGTCGTCAT ACTCCATTCT TCAAAGGCTA CCGTCCACAG TTCTACTTCC 1000 

GTACAACTGA CGTGACCGGT ACTATCGAAC TGCCAGAAGG CGTAGAGATG 1050 

963 
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GTGATGCCTG GTGACAACAT TCAGATGACT GTAACTCTGA TTGCACCAAT J 100 
CGCGATGGAC GAAGGTTTAC GCTTCGCTA 1129 



2) INFORMATION FOR SEQ ID NO: 1844 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 810 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear, 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Abiotrophia 

(B) STRAIN: ATCC 49175 

(xi) SEQUENCE DESCRIPTION: SEQ 

ATATTCATAA TGCATTACAA GTTGTGAAAA 
AAGACTGTTA CATTGGAAAC TGCTGTAGAA 
TACGATTGCC ATGGAATCTA CAGATGGTTT 
TGGACTTAGG ACGCACAATT AGCGTTCCTG 
CGTGTATTCA ACGTTTTAGG AGATACAATC 
AGAAGACTTT ACAAGACATG AAATCCATAA 
AATTAAACAG TCAATATGAA ATTCTACAAA 
CTTTTAGCAC CTTATCTTAA AGGTGGTAAA 
CGGTGTAGGG AAAACCGTAT TAATTCAAGA 
AAGAACTTGG TGGTATTTCA GTATTTACAG 
GAAGGGAATG ACCTTTACCA TGAAATGCAA 
AACAGCGATG GTGTTTGGGC AAATGAACGA 
GTGTAGCACT AACAGGGTTA ACTATTGCGG 
AAACAAGACG TGCTTTTATT CATCGATAAC 
AGGTTCAGAA GTGTCAGCGT TACTTGGTCG 
ATCAACCAAC ATTAGCGACA GAAATGGGTC 
TCAACTAAAG 



adiacens 



ID NO: 1844 

CAACAAGTGA CGGAAGTGAA 50 

TTAGGGGATG GTGCAGTTCG 100 

GCAACGTGGC ATGAAAGTAG 150 

TGGGACCTGA AACATTAGGT 200 

GACTTGAAAG AACCATTCCC 250 

ACCAGCACCA AAATTTGAAG 300 

CAGGGATTAA AGTTATTGAC 350 

ATCGGTTTAT TCGGTGGTGC 400 

ATTAATTCAT AATATCGCTG 450 

GGGTAGGGGA ACGTACTCGT 500 

GAATCAGGCG TATCTGCTAA 550 

ACCACCAGGA GCTCGTATGC 600 

AATACTTCCG TGATATGGAA 650 

ATTTATCGTT TCACGCAAGC 700 

TATGCCTTCT GCCGTAGGGT 750 

AATTACAAGA ACGTATCAGT 800 

810 



2) INFORMATION FOR SEQ ID NO: 1845 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 815 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arcanobacterium haemolyticum 
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(B) STRAIN : ATCC 9345 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1845 

5 TGCTCTGCCA GAAATTAACA ACGCCCTCCT CACCGAAGTA GACCTCTCCG 50 

GCCAAGGCGA AGGCGAAAGC GTTCTCAAGA TGACTCTTGA GGTTGCTCAG 100 

CACCTCGGCG ATAACATCGT CCGTACCATC GCCATGAAGC CAACCGACGG 150 

TCTGGTTCGC GGCGCCACCG TTATCGATAC CGGCGCCCCA ATCACCGTGC 200 

CAGTTGGCGA CGCAACTAAA GGTCATGTTT TCAACGTGAC CGGTGATGTC 250 

10 CTAAACTTGG GCGAAGGCGA AACCCTTGAC GTCAAGGAAC GGTGGCCAAT 300 

CCACCGCAAG GCTCCACAGT TCGACGAACT CGAACCGGAA ACCAAGATGT 350 

TCGAAACAGG CATCAAGGTG ATCGATCTCC TCACCCCATA CGTACAGGGC 400 

GGCAAGATCG GTCTGTTTGG CGGTGCTGGT, GTTGGTAAGA CCGTTCTTAT 450 

CCAGGAAATG ATCCAGCGTG TTGCACAGGA TCATGGCGGT GTGTCCGTGT 500 

15 TCGCGGGTGT GGGTGAACGT ACCCGTGAAG GTAACGATCT TATCCACGAA 550 

ATGGAAGATG CGGGCGTTCT TGATAAGACC GCGCTTGTGT TCGGCCAGAT 600 

GGATGAACCG CCAGGGGTTC GTTTGCGTAT TGCACTTTCC GGCCTGACCA 650 

TGGCGGAATA CTTCCGTGAC GTGCAAAACC AGGACGTGCT TTTGTTCATC 700 

GATAACATCT TCCGCTTCAC CCAGGCAGGT TCGGAAGTGT CCACGTTGCT 750 

20 TGGCCGTATG CCATCAGCAG TGGGCTACCA GCCGACCTTG GCAGATKAAA 800 

TGGGCGCATT GCAGG 815 



25 2) INFORMATION FOR SEQ ID NO: 1846 



30 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1073 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

35 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Basidiobolus ranarum 

(B) STRAIN: ATCC 24670 



(xi) SEQUENCE DESCRIPTION: SEQ 

40 

ACCTTCCTCC 

TTAGTTTTAG 

TGCTATGGAC 

CCGGATATCC 
45 ATGAACGTTA 

GAAGCTTGCA 

CCACCCCCGA 

CCTTACGCTC 

TAAGACTGTG 
50 GTGGTTACTC 

GATTTGTACC 

ATCCAAGGCT 

GTGCCCGMGT 



ID NO: 1846 



TATCTTGAAH 
AGGTGTCCCA 
GGTACTGAAG 
TATTAGAGTT 
TTGGCGAGCC 
CCCATCCACG 
AATCTTGGAG 
GTGGTGGTAA 
TTTATCCAGG 
CGTGTTCGCT 
ACGAGATGAT 
GCCCTTGTCT 
CGCTTTGACC 



GCCCTGGAGG 
GCATTTGGGT 
GATTGGTTCG 
CCTGTCGGTC 
TGTTGATGAG 
CTTCTCCCCC 
ACTGGTATTA 
GATCGGTCTT 
AGTTGATYAA 
GGTGTTGGTG 
TCAAACTGGT 
ACGGMCAAAT 
GGTCTTACCG 



TCCAAAACCA 


CAGCTCTCGY 


50 


GAAAACACCG 


TTCGTACTAT 


100 


TGGTCAAAAT 


GTCGTAGATA 


150 


CTGAATGTTT 


GGGTCGTATC 


200 


CGCGGYCCTA 


TCAAGACCAA 


250 


CGAGTTCGTY 


GACCAATCCA 


300 


AGGTTGTCGA 


TTTGTTGGCC 


350 


TTCGGTGGTG 


CCGGTGTCGG 


400 


CAACGTTGCC 


AAGGCCCACG 


450 


AGCGTACTCG 


TGAGGGTAAC 


500 


GTCATCAAGC 


TTGATGGCCA 


550 


GAACGAGCCC 


CCAGGTGCTC 


600 


TTGCTGAATA 


CTTCCGTGAT 


650 
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GAGGAAGGHC AAGATGTGTT GCTCTTCATT 
CCAAGCTGGT TCTGAAGTGT CYGCCTTGTT 
TCGGTTACCA ACCCACCTTG GCCACCGATA 
ATTACCACCA CCAAGAAGGG TTCCATTACC 
5 CCCTGCTGAT GATTTGACCG ATCCCGCTCC 
TTGACGCCAC CACCGTGTTG TCTCGTTCCA 
CCCGCTGTCG ATCCCCTCGA CTCCAAGTCT 
■ TGTCGGTGAA GAGCACTACG ACATCGCCAC 
ARTCTTACAA GTCTCTCCAG GAT 



PCT/CAOO/01150 

GACAACATTT TCCGTTTCAC 700 

GGGTCGTATC CCCTCCGCTG 750 

TGGGTGTCAT GCAAGAGCGT . 800 

TCTGTCCAGG CCATTTACGT 850 

TGCCACTACT TTTGCCCATC 900 

TCTCTGAGTT GGGTATTTAC 950 

CGTATGTTGG ATCCYCGTAT 1000 

TGGTGTTCAG AAGATTCTCC 1050 

1073 



2) INFORMATION FOR SEQ ID NO: 1847 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 480 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



20 



40 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Blastomyces dermatitidis 
25 (B) STRAIN: ATCC 56220 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1847 

TGTCTTTATC CAGGAGTTGA TTGTACGTCT TGCCTCACCC TTTGGGTATT 50 

30 TTGCGAATAC TAATTATAGT AGAACAACAT TGCCAAGGCT CACGGTGGTT 100 

ACTCTGTCTT CACTGGTGTC GGTGAACGTA CTCGTGAGGG TAACGATTTG 150 

TACCACGAAA TGCAGGAAAC TGGTGTCATT CAGCTCGAGG GTGAATCCAA 200 

GGTCGCCCTC GTGTTCGGTC AGATGAACGA GCCCCCTGGT GCCCGTGCCC 250 

GTGTCGCTCT TACTGGTTTG ACCATTGCCG AGTACTTCCG TGACGAGGAG 300 

35 GGTCAAGATG TGCTTCTCTT CATTGACAAC ATTTTCCGTT TCACTCAGGC 350 

CGGTTCTGAG GTGTCTGCCC TTTTGGGTCG TATCCCCTCT GCCGTCGGTT 4 00 

ACCAGCCCAC TCTCGCCGTC GACATGGGTG TCATGCAGGA GCGTATTACC 4 50 

ACCACCACCA AGGGTTCCAT CACCTCCGTC 4 80 



2) INFORMATION FOR SEQ ID NO: 1848 



(i) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 566 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

50 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Blastomyces dermatitidis 
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(B) STRAIN: ATCC 14112 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1848 

5 TGTCTTCATT CAGGAGTTGA TTGTACGTCC CTTCCTCTCT ACAAATGACG 50 

GGCGAGGAAA ATTTTTGGCT TTTTCTAATA GCTCGTTATA GAACAACATT 100 

GCCAAAGCCC ACGGTGGTTA CTCCGTTTTC ACTGGTGTCG GCGAGCGGAC 150 

CCGTGAAGGA AACGATTTGT ACCACGAGAT GCAGGAAACC CGTGTTATCC 200 

AGCTCGATGG CGAGTCTAAG GTCGCACTCG TCTTCGGTCA GATGAACGAG 250 

10 CCCCCCGGAG CCCGTGCCCG TGTTGCCCTC ACTGGCCTGA CCATTGCTGA 300 

ATATTTCCGT GACGAGGAAG GTCAAGACGG TATGTATTCA TATAAATTAC 350 

TCCGGGCAAA TTGACTCAGA ACCGCACTCA CTCACACATA TATTAGTGCT 4 00 

TCTCTTTATC GACAACATTT TCCGCTTCAC CCAGGCCGGT TCCGAAGTGT 4 50 

CCGCCCTGCT TGGTCGTATT CCCTCCGCCG TCGGTTACCA ACCCACTCTC 500 

15 GCCGTCGACA TGGGTGGTAT GCAGGAACGT ATCACAACCA CCACCAAGGG 550 

CTCCATTACC TYCGTG 566 



20 2) INFORMATION FOR SEQ ID NO: 1849 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 817 bases 

(B) TYPE: Nucleic acid 
25 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

30 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Campylobacter coli 

(B) STRAIN: ATCC 43479 



35- 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1849 



AATGAAGCCA TTGTTGTAAA TTTTGAAAGT GAAGGCAAAA AACAAAAACT 50 

TGTTTTAGAA GTAGCAGCAC ACTTGGGCGA TAATAGAGTT AGAACTATTG 100 

CTATGGATAT GACAGATGGC TTGGTAAGAG GACTTAAAGC AGAAGCTTTG 150 

GGTGCTCCTA TTAGCGTTCC TGTGGGTGAA AAAGTTTTAG GAAGAATTTT 200 

40 TAATGTTACG GGAGATTTGA TCGATGAAGG TGAAGAAATT TCTTTTGATA 250 

AAAAATGGGC AATTCATAGA GATCCACCAG CTTTTGAAGA TCAAAGCACA 300 

AAAAGTGAGA TTTTTGAAAC AGGGATTAAA GTTGTGGATT TACTTGCTCC 350 

TTATGCAAAA GGTGGTAAAG TAGGTCTTTT TGGTGGTGCA GGTGTTGGTA 400 

AAACTGTTAT TATTATGGAG CTTATTCACA ATGTTGCATT TAAACATAGC 450 

45 GGCTATTCTG TATTTGCAGG TGTAGGTGAG AGAACTCGTG AAGGAAATGA 500 

CCTTTATAAT GAAATGAAAG AAAGTAATGT TTTAGACAAA GTTGCTCTAT 550 

GTTATGGACA AATGAATGAA CCACCAGGGG CAAGAAATCG TATTGCTTTA 600 

ACAGGTTTAA CAATGGCTGA GTATTTTAGA GATGAAATGG GTCTTGATGT 650 

GCTTATGTTT ATTGATAATA TCTTTAGATT TTCACAATCA GGTTCTGAAA 700 

50 TGTCAGCACT TTTAGGAAGA ATTCCATCAG CTGTGGGTTA TCAACCAACC 750 

CTAGCAAGTG AAATGGGTAA ATTCCAAGAA AGAATTACTT CAACTAAAAA 800 

AGGATCAATT ACTTCAG 817 



967 
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2) INFORMATION FOR SEQ ID NO: 1850 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 775 bases 
<B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Campylobacter fetus subsp. fetus 

(B) STRAIN: ATCC 25936 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1850 

AAGGCAACAC GCATAAACTT ATTTTAGAGA CTGCTGCACA CCTTGGAGAT 50 

AATCGTGTAA GAACTATCGC TATGGATATG AGCGAAGGAC TTACAAGAGG 100 

GTTAGATGCT ATAGCGCTTG GGTCGCCTAT CAGTGTTCCT GTTGGAGAAA 150 

AAGTTTTAGG AAGAATATTC AACGTAATTG GTGATCTTAT AGACGAAGGC 200 

GAAGAAGAAA AATTTGATAA AAAATGGTCG ATTCATAGAG ATCCGCCGGC 250 

ATTTGAAGAT CAAAGCACAA AAAGTGAAAT TTTTGAAACA GGTATAAAAG 300 

TCGTAGATCT TTTGGCTCCT TATGCAAAAG GCGGTAAAGT TGGACTATTT 350 

GGCGGTGCCG GCGTTGGTAA AACAGTTATC ATTATGGAAC TTATCCACAA 400 

CGTTGCATTC AAACACAGCG GCTATTCGGT ATTTGCCGGT GTCGGTGAAA 4 50 

GAACAAGAGA GGGTAACGAT CTTTATAATG AAATGAAAGA ATCCGGCGTT 500 

TTGGATAAAG TTGCCTTATG TTATGGACAA ATGAATGAAC CGCCGGGTGC 550 

AAGAAACCGT ATAGCGCTTA CTGGTCTTAC AATGGCTGAG TATTTTCGTG 600 

ACGAGATGGG ACTAGATGTT CTTATGTTTA TCGATAACAT CTTCCGTTTC 650 

TCACAATCAG GCTCAGAGAT GTCGGCTCTT CTTGGACGTA TCCCAAGTGC 700 

GGTTGGTTAT CAACCAACGT TAGCTAGCGA AATGGGAAGA CTTCAAGAAA 750 

GAATCACATC AACTAAAAAA GGTTC 775 



2) INFORMATION FOR SEQ ID NO: 1851 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 793 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Campylobacter fetus subsp- venerealis 

(B) STRAIN: ATCC 33561 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1851 

CGAAGCTATT GAAGTAAATT TTACAGTAGA AGGCAACACG CATAAACTTA 50 
TTTTAGAGAC TGCTGCACAC CTTGGAGATA ATCGTGTAAG AACTATCGCT 100 
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ATGGATATGA GCGAAGGACT 
GTCGCCTATC AGTGTTCCTG 
ACGTAATTGG TGATCTTATA 
AAATGGTCGA TTCATAGAGA 
5 AAGTGAAATT TTTGAAACAG 
ATGCAAAAGG CGGTAAAGTT 
ACAGTTATCA TTATGGAACT 
CTATTCGGTA TTTGCCGGTG 
TTTATAATGA AATGAAAGAA 
10 TATGGACAAA TGAATGAACC 
TGGTCTTACA ATGGCTGAGT 
TTATGTTTAT CGATAACATC 
TCGGCTCTTC TTGGACGTAT 
AGCTAGCGAA ATGGGAAGAC 

15 



TACAAGAGGG TTAGATGCTA 
TTGGAGAAAA AGTTTTAGGA 
GACGAAGGCG AAGAAGAAAA 
TCCGCCGGCA TTTGAAGATC 
GTATAAAAGT CGTAGATCTT 
GGACTATTTG GCGGTGCCGG 
TATCCACAAC GTTGCATTCA 
TCGGTGAAAG AACAAGAGAG 
TCCGGCGTTT TGGATAAAGT 
GCCGGGTGCA AGAAACCGTA 
ATTTTCGTGA CGAGATGGGA 
TTCCGTTTCT CACAATCAGG 
CCCAAGTGCG, GTTGGTTATC 
TTCAAGAAAG AATCACATCA 



TAGCGCTTGG 150 

AGAATATTCA 200 

ATTTGATAAA 250 

AAAGCACAAA 300 

TTGGCTCCTT 350 

CGTTGGTAAA 400 

AACACAGCGG 4 50 

GGTAACGATC 500 

TGCCTTATGT 550 

TAGCGCTTAC 600 

CTAGATGTTC 650 

CTCAGAGATG 700 

AACCAACGTT 750 

ACT 7 93 



2) INFORMATION FOR SEQ ID NO: 1852 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 825 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double. 

(D) TOPOLOGY: Linear 



25 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Campylobacter gracilis 

30 (B) STRAIN: ATCC 33236 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1852 

GGACTATTTA CCGAAGATTA ACGAAGCTAT CGAGGTTAAA TTTGACGTCG 50 

35 AGGGCGCTCA TCGCAGGCTG ATCCTAGAGG TAGCCGCGCA CCTTGGAGAC 100 

AATCGCGTCC GCACGATCGC TATGGATATG AGCGATGGAC TTAGGCGAGG 150 

GCTTGAGGCC GTCGCTTTGG GCGCGCCTAT TACGGTGCCT GTGGGCGAGA 200 

AAGTTTTGGG TAGAATTTTT AATGTTACGG GCGATCTGAT CGACGAAGGC 250 

GAGGATGAAA AATTTGAAAC CCGCTGGTCG ATCCACAGAG ATCCGCCTAG 300 

4 0 CTTTGAAAAT CAAAGCACGA AGAGTGAAAT TTTTGAAACC GGCATTAAGG 350 

TAGTCGATCT GCTCGCCCCT TATGCAAAGG GCGGTAAGGT AGGACTATTC 400 

GGCGGTGCTG GCGTCGGTAA GACCGTCATC ATCATGGAAC TGATTCACAA 450 

CGTCGCTTTC AAACACAGCG GCTACTCCGT ATTTGCGGGT GTCGGCGAGC 500 

GAACGAGAGA GGGAAACGAC CTTTATAACG AGATGAAAGA ATCGGGCGTT 550 

45 TTGGATAAAG TCGCCTTGAC CTATGGTCAG ATGAACGAAC CGCCGGGAGC 600 

GAGAAACCGT ATCGCGCTAA CCGGTCTTAC GATGGCCGAG TATTTCCGCG 650 

ACGAGCTAGG GCTTGACGTT TTGATGTTTA TTGATAATAT CTTCCGCTTC 700 

TCGCAGTCGG GTTCGGAGAT GTCCGCGCTT TTAGGACGAA TTCCGTCCGC 750 

GGTCGGTTAT CAGCCTACGC TTGCCAGCGA AATGGGTAAA TTACAGGAGC 800 

50 GCATTACTTC TACTAAGAAG GGCTC 825 
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2) INFORMATION FOR SEQ ID NO: 1853 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 818 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Campylobacter jejuni subsp. jejuni 

(B) STRAIN: ATCC 33560 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1853 

TTTACCTCAA ATTAATGAAG CAATTGTTGT AAATTTTGAA AGCGAAGGAA 50 

AAAAACATAA ACTTGTTTTA GAAGTAGCAG CTCATTTAGG AGATAATAGA 100 

GTTAGAACTA TTGCTATGGA TATGACAGAT GGTTTGGTAA GGGGCTTAAA 150 

AGCTGAGGCT TTAGGTGCTC CTATTAGTGT TCCTGTTGGT GAGAAAGTTT 200 

TAGGAAGAAT TTTCAATGTT ACTGGAGATT TGATCGATGA AGGTGAAGAA 250 

ATTTCTTTTG ATAAAAAATG GGCAATTCAT AGAGATCCGC CAGCTTTTGA 300 

AGATCAAAGC ACAAAAAGTG AGATTTTTGA AACAGGGATT AAAGTTGTAG 350 

ATTTGCTTGC TCCTTATGCA AAAGGTGGTA AAGTAGGTCT TTTTGGTGGT 400 

GCAGGTGTTG GTAAAACTGT TATTATTATG GAGCTTATTC ACAATGTTGC 450 

ATTTAAGCAT AGCGGCTATT CTGTATTTGC AGGTGTGGGT GAGAGAACTC 500 

GTGAAGGAAA TGACCTTTAT AATGAAATGA AAGAAAGTAA TGTTTTAGAC 550 

AAAGTTGCTC TATGTTATGG ACAAATGAAT GAACCACCAG GAGCAAGAAA 600 

TCGTATTGCT TTAACAGGTT TAACAATGGC TGAGTATTTT AGAGATGAAA 650 

TGGGTCTTGA TGTGCTTATG TTTATTGATA ATATCTTTAG ATTTTCACAA 700 

TCAGGTTCTG AAATGTCAGC ACTTTTAGGA AGAATTCCAT CAGCTGTGGG 750 

TTATCAACCA ACCCTAGCAA GTGAAATGGG TAAATTCCAA GAAAGAATTA 800 

CTTCAACTAA AAAAGGCT 818 



2) INFORMATION FOR SEQ ID NO: 1854 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 830 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus cecorum 

(B) STRAIN: ATCC 43198 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1854 

ATTACCTGAT ATCAACAACG CCTTATTGGT CTATAAAAAT GATGAACAAA 50 
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AAAGTAAAAT TGTGCTAGAA GCTGCCTTAG 

CGTACAATTG CCATGGAATC AACGGATGGT 

TGTCGATACT GGTAAACCAA TTTCAGTTCC 

GACGTGTCTT TAACGTTTTA GGGGATACGA 

5 GCACAAGATG CAGATCGTTC TGCAATTCAT 

AGACTTAAGT ACAAGTACTG AAATTTTAGA 

ACTTATTAGC ACCATATTTA AAAGGTGGTA 

GCCGGAGTAG GTAAAACCGT TTTAATCCAA 

ACAAGAACAT GGTGGGATTT CTGTATTTAC 

10 GTGAAGGAAA TGACTTGTAT CATGAAATGC 

AAAACTGCCA TGGTGTTTGG TCAAATGAAC 

GCGTGTTGCT TTAACTGGGT TAACGATTGC 

AAGGACAAGA TGTGTTGCTA TTTATTGATA 

GCGGGTTCTG AAGTATCAGC CTTGCTTGGT 

15 TTATCAACCT ACATTGGCTA CAGAAATGGG 

CTTCAACTAA GAAGGGCTCT ATCACTTCTA 
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AATTAGGTGA TGGCATCATT 100 

TTACAACGTG GGATGGAAGT 150 

AGTTGGTAAA GAAACGCTAG 200 

TTGATATGCA AGAACCATTT 250 

AAAGCTGCAC CAAAATTTGA 300 

AACAGGGATT AAAGTTATCG 350 

AAGTCGGTCT ATTCGGGGGT 400 

GAATTAATCC ATAATATTGC 450 

CGGTGTTGGT GAACGTACAC 500 

GTGATTCAGG AGTTATTGAA 550 

GAACCACCTG GAGCTCGTAT 600 

TGAATATTTC CGTGATGTAG 650 

ACATCTTCCG TTTCACTCAA 700 

CGTATGCCAT CTGCCGTGGG 750 

TCAATTACAA GAACGTATCA 800 

830 



20 2) INFORMATION FOR SEQ ID NO: 1855 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 823 bases 

(B) TYPE: Nucleic acid 
25 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

30 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus columbae 

(B) STRAIN: ATCC 51263 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1855 



TCTTTACCAG ATATCAATAA TGCGCTTATT GTCTATAAAA ATGATGAACA 50 

AAAAAGTAAA ATCGTGCTTG AAGCTGCTTT AGAGCTAGGA GATGGCATTA 100 

TTCGTACGAT TGCAATGGAA TCAACTGATG GATTGCAACG TGGAATGGAA 150 

GTTTTCGATA CAGGTAAGCC AATTTCAGTA CCAGTAGGTC GTGAAACATT 200 

40 AGGTCGTGTA TTTAATGTTT TAGGTGATAC CATTGATACG CAAGAAGCTT 250 

TTCCTGCTGA TGCGAATCGT GATGCGATTC ATAAATCAGC TCCAGCTTTT 300 

GAAGAATTAA GTACAAGTAC TGAAATCCTA GAAACAGGGA TTAAAGTTAT 350 

CGACTTACTA GCACCATACT TAAAAGGTGG GAAAGTTGGT CTATTCGGTG 400 

GTGCCGGTGT AGGTAAAACC GTATTAATTC AAGAATTAAT TCATAATATC 450 

45 GCCCAAGAAC ATGGGGGTAT TTCAGTATTT ACCGGTGTTG GTGAACGTAC 500 

ACGTGAAGGA AATGACTTGT ATCACGAAAT GCGTGATTCA GGCGTTATCG 550 

AAAAAACTGC TATGGTGTTT GGGCAAATGA ACGAACCACC TGGAGCACGT 600 

ATGCGTGTTG CGCTAACTGG ACTAACTATT GCGGAATACT TCCGTGATGT 650 

TGAAGGCCAA GACGTATTGC TATTTATTGA TAATATCTTC CGTTTTACTC 700 

50 AAGCAGGTTC TGAAGTTTCT GCCTTACTTG GTCGTATGCC TTCTGCGGTA 750 

GGTTATCAAC CTACTTTGGC TACTGAAATG GGTCAATTGC AAGAACGGAT 800 

TACATCAACG AAGAAAGGTT CGA 823 
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2) INFORMATION FOR SEQ ID NO: 1856 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 826 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus dispar 

(B) STRAIN: ATCC 51266 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1856 

TTACCAGACA TTAATAATGC CTTGGTTGTC TATAAAAATG ACGAACAAAA 50 

AACCAAGATT GTATTAGAAG CTGCCTTAGA ACTAGGAGAT GGTGTGATTC 100 

GAACTATCGC CATGGAATCT ACTGATGGCT TACAACGGGG AATGGAAGTT 150 

GTCGATACTG GCAGTTCCAT TTCTGTACCG GTAGGAAAAG AAACATTGGG 200 

TCGTGTATTT AACGTTTTAG GAAATACAAT TGACTTAGAA GAACCTTTTC 250 

CAGCGGATGC TAAACGTAGT GGTATCCATA AAAAAGCGCC TGATTTTGAT 300 

GAATTAAGCA CTAGTACAGA AATTTTAGAA ACAGGGATTA AAGTTATTGA 350 

CCTATTAGCC CCTTATTTAA AAGGTGGTAA AGTCGGATTA TTCGGTGGTG 400 

CCGGAGTTGG TAAAACCGTT TTAATTCAAG AATTAATTCA TAATATTGCC 450 

CAAGAACATG GTGGGATTTC TGTTTTTACT GGTGTTGGTG AAAGAACACG 500 

TGAAGGTAAT GACTTGTATT ATGAAATGAA AGAATCTGGC GTTATCGAAA 550 

AAACTGCCAT GGTATTTGGT CAAATGAATG AGCCACCTGG TGCCCGGATG 600 

CGGGTTGCTT TAACCGGACT TACCATTGCG GAATACTTCC GGGACGTTGA 650 

AGGACAAGAT GTATTGCTCT TTATCGATAA TATTTTCCGT TTTACCCAAG 700 

CTGGTTCAGA AGTATCTGCC TTATTAGGAC GGATGCCCTC TGCCGTTGGT 750 

TATCAACCAA CTTTGGCTAC TGAAATGGGA CAACTTCAAG AACGGATTAC 800 

CTCAACGAAA AAAGGTTCTA TTACAT 826 



2) INFORMATION FOR SEQ ID NO: 1857 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 814 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus malodoratus 

(B) STRAIN: ATCC 4 3197 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1857 
TCCTTACCAG ACATCAACAA TGCGTTGATT GTTTACAAAA AAAATAAAAC 50 
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AAAAGTTGTT CTTGAAGCTG CTTTGGAACT 
CGATCTCTAT GGAATCAACA GATGGCTTGC 
GATACAGGCA AACCAATCTC AGTTCCCGTT 
TGTGTTTAAC GTATTAGGTG AAACAATCGA 
5 AAGATGCAGT AAAAAGCGGT ATTCATAAAA 
CTTAGTACCA GTAATGAAAT TTTAGAAACA 
ATTAGCTCCT TACTTAAAGG GTGGTAAAGT 
GTGTTGGTAA AACCGTCTTG ATCCAAGAAT 
GAACACGGTG GTATTTCAGT GTTTACGGGT 

10 AGGGAACGAC CTTTATTATG AAATGAAGGA 
CTGCCATGGT GTTTGGACAA ATGAACGAGC 
GTTGCCTTGA CTGGTTTGAC ATTGGCTGAA 
ACAAGATGTG CTGTTGTTTA TCGACAACAT 
GTTCTGAAGT TTCTGCCTTG CTTGGCCGGA 

15 CAACCAACTT TGGCAACTGA AATGGGTCAA 
AACGAAGAAG GGCT 
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TGGTGATGGT GTTATCCGCA 100 

AACGTGGAAT GGAAGTTGTC 150 

GGTAAAGAAA CTTTAGGTCG 200 

CAAAGAAGCG CCTTTTCCAG 250 

AAGCGCCGGC TTTTGAAGAA 300 

GGGATCAAAG TTATCGACTT 350 

CGGACTATTT GGTGGTGCCG 400 

TGATTCATAA TATCGCCCAA 450 

GTTGGTGAAC GTACTCGTGA 500 

ATCAGGCGTT ATTGAGAAAA 550 

CGCCAGGTGC GCGTATGCGT 600 

TATTTCCGAG ATGAAGAAGG 650 

CTTCCGTTTC ACTCAAGCCG 700 

TGCCTTCAGC CGTTGGCTAC 750 

TTGCAAGAAC GAATCACTTC 800 

814 



20 2) INFORMATION FOR SEQ ID NO: 1858 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 791 bases 

(B) TYPE: Nucleic acid 
25 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

30 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus mvndtii 

(B) STRAIN: ATCC 43186 



35 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1858 

CGCATTAGTT GTTTATAAAA ATGATGAGCA 
AAGCAGCATT AGAATTAGGT GACGGTGTGA 
TCGACGGATG GACTACAACG TGGAATGGAA 
GATCTCTGTA CCAGTTGGAA CAGAAACATT 
TAGGTGAAAC AATCGATTTG GAAGCACCAT 
AGCGAGATCC ACAAGAAAGC ACCAAATTTT 
AGAGATTCTT GAAACTGGGA TCAAAGTCAT 



GTACTGATCC AAGAATTGAT CCATAATATC 
45 CTCAGTGTTT ACCGGTGTAG GGGAACGTAC 
ATTACGAAAT GAAAGATTCA GGCGTAATCG 
GGACAAATGA ATGAGCCACC AGGTGCTCGT 
ATTGACGATT GCGGAATATT TCCGTGATGT 
TATTTATTGA TAATATTTTC CGTTTCACCC 
50 GCCTTACTAG GACGTATGCC ATCAGCGGTT 
GACTGAAATG GGACAACTCC AAGAACGGAT 



AAAATCAAAA 


GTTGTTCTTG 


50 


TCCGTACGAT 


CGCAATGGAA 


100 


GTCATCGACA 


CAAGCAAAGC 


150 


AGGTCGTGTG 


TTCAACGTGT 


200 


TTCCAGAGGA 


TGCCCAAAGA 


250 


GATGAATTAA 


GCACAAGTAC 


300 


TGACTTATTA 


GCACCTTATT 


350 


GTGCCGGTGT 


TGGTAAAACC 


400 


GCCCAAGAAC 


ATGGGGGAAT 


450 


CCGTGAAGGA 


AACGATCTGT 


500 


AAAAAACAGC 


GATGGTGTTT 


550 


ATGCGTGTCG 


CACTAACTGG 


600 


CGAAGGACAA 


GACGTGCTCT 


650 


AAGCAGGTTC 


AGAAGTATCT 


700 


GGTTATCAAC 


CAACCTTAGC 


750 


CACTTCAACG 


A 


791 
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2) INFORMATION FOR SEQ ID NO: 1859 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 817 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus raffinosus 

(B) STRAIN: ATCC 49427 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1859 

TCCTTACCAG ACATCAACAA TGCGTTGATT GTTTATAAAA AAGATAAAAC 50 

AAAAGTTGTT CTTGAAGCTG CTTTGGAACT TGGTGATGGT GTTATTCGCA 100 

CAATCGCCAT- GGAATCAACG GATGGATTAC AACGTGGAAT GGAAGTTGTC 150 

GATACTGGCA AGCCTATTTC TGTTCCAGTA GGAAAAGAAA CTCTAGGTCG 200 

TGTATTTAAT GTATTAGGTG AAACAATCGA CAAGGAAGCG CCTTTTCCAG 250 

AAGATGCAGA AAAAAGTGGT ATTCACAAGA AAGCACCAAC TTTCGAAGAA 300 

CTTAGCACAA GTAATGAGAT CTTAGAAACA GGAATCAAAG TTATTGACTT 350 

GTTAGCTCCT TACTTAAAAG GTGGTAAAGT TGGATTATTT GGTGGTGCCG 400 

GTGTTGGTAA AACAGTCTTG ATTCAAGAGC TAATTCATAA TATCGCTCAA 450 

GAACATGGTG GTATTTCTGT GTTTACTGGT GTTGGTGAAC GTACTCGTGA 500 

AGGGAACGAC CTTTATTATG AAATGAAAGA TTCTGGTGTT ATTGAGAAAA 550 

CTGCTATGGT GTTCGGTCAA ATGAACGAGC CGCCAGGTGC ACGTATGCGT 600 

GTTGCCTTAA CTGGTTTAAC CTTAGCCGAA TACTTCCGTG ATGAAGAAGG 650 

ACAAGATGTG TTGCTATTTA TTGACAACAT TTTCCGTTTC ACTCAAGCCG 700 

GATCAGAAGT TTCTGCCTTA CTTGGCCGTA TGCCGTCAGC AGTTGGTTAC 750 

CAACCGACTT TAGCAACTGA AATGGGTCAA TTACAAGAAC GTATTACGTC 800 

GACGAAAAAA GGTTCAA 817 



2) INFORMATION FOR SEQ ID NO: 1860 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 852 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Globicatella sanguis 

(B) STRAIN: ATCC 51173 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1860 

CCTGACATTC ATAATGCATT AATTGTAACG AACGCTGATA TGGCGGATGT 50 
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AATGCAAGAA AATATTTCGG ATGAAGAAAA ATTATTAACC TTAGAAGTTG 100 

CACTGGATTT AGGTCATGGA ATGGTCCGGA CAATTGCGAT GGAATCAACC 150 

GATGGTTTGG AACGCGGCAT GACAGTTGTG GATTATTTAA CACCGATTAA 200 

AGTGCCAGTA GGCGAAGCCA CTTTAGGTAG AGTATTCAAT GTTTTAGGTG 250 

5 AGACAATTGA TGAACTAGAA CCGGTTGGCG ACGATGTTGA ACTCAAAAGT 300 

ATTCATCGTG AAGCCCCTAA ATATGAGGAC TTAGATAATA GTTTTCATGT 350 

TTTAGAAACC GGAATTAAGG TCATCGATTT ATTAGCTCCT TATATTAAAG 400 

GGGGAAAAAT CGGTTTATTC GGTGGTGCCG GAGTGGGTAA AACGGTCTTA 4 50 

ATTCAAGAAT TAATTCATAA TATTGCAGAA CAATTAGGAG GTATCTCAGT 500 

10 TTTCACTGGG GTTGGAGAAC GTACCCGTGA AGGGAATGAC CTCGTTTTTG 550 

AAATGCGAGA GTCAGGTGTA AGCAAGAAGA CGGCCATGGT TTTCGGTCAA 600 

ATGAATGAAC CACCTGGAGC ACGTATGCGT GTTGTCTTAA CAGGACTTAC 650 

AATGGCGGAA TATTTCCGTG ACGAATTGAA ACAAGACGTC TTATTATTTA 700 

TTGATAATAT TTATCGTTTT ACTCAAGCAG GTTCCGAAGT GTCAGCCTTA 750 

15 TTAGGTCGTA TGCCTTCAGC AGTAGGGTAT CAACCAACTT TAGCAAGTGA 800 

AATGGGACAA ATGCAAGAAG GTATTACGTC WACGAAGCRC GGTTCCATTA 850 

CA 852 



20 



2) INFORMATION FOR SEQ ID NO: 1861 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH; 828 bases 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



30- 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Lactococcus garvieae 

(B) STRAIN: ATCC 49156 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1861 

GCGCGACTCT TCCTGAGATT AATAACGCAC TCATCGTTTA CAAAGATGTA 50 

GACGGCGTTA AAACTAAAAT CGTCCTTGAA GTGGCGTTGG AACTTGGTGA 100 

TGGTGCCGTA CGTACCATCG CTATGGAATC AACTGATGGC TTGACACGTG 150 

40 GACTTGAAGT TCTCGATACA GGTAAAGCAA TCAGCGTACC TGTTGGTCAA 200 

GAAACACTTG GACGTGTCTT CAATGTACTT GGAGATGCTA TTGATGGAGG 250 

• GGAAGCATTT GCTGAAAATG CAGAACGCAG CCCTATCCAT AAAAAAGCCC 300 

CATCTTTTGA TGAACTTTCA ACAGCAAATG AAATTCTGGT GACAGGGATT 350 

AAAGTTATTG ACTTGCTTGC CCCATACCTT AAAGGTGGTA AGATTGGGTT 400 

45 GTTCGGTGGT GCCGGAGTTG GTAAAACCGT CCTTATCCAA GAGTTGATTC 450 

ACAATATTGC CCAAGAACAC GGTGGTATTT CCGTATTTAC TGGTGTTGGG 500 

GAACGTACAC GTGAAGGGAA TGACCTTTAC TGGGAAATGA AAGAATCAGG 550 

CGTTATCGAA AAAACAGCCA TGGTCTTCGG TCAAATGAAT GAACCACCTG 600 

GAGCACGTAT GCGTGTTGCT CTTACTGGTT TGACAATTGC TGAATATTTC 650 

50. CGTGATGTAG AAAAACAAGA CGTTTTGCTT TTCATTGATA ATATCTTCCG 700 

TTTCACCCAA GCCGGTTCAG AAGTATCTGC CCTCTTAGGA CGTATGCCAT 750 

CAGCCGTTGG TTACCAACCT ACGCTTGCAA CTGAAATGGG TCAACTTCAA 800 

GAACGTATCA CTTCAACAAA ACAAGGTT 828 

975 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



10 



2) INFORMATION FOR SEQ ID NO: 1862 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 828 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Lactococcus lactis 

(B) STRAIN: ATCC 11454 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1862 

AATTGCCTGA RATTAACAAT GCCTTGATTG TCTACAAAGA TGTCAATGGC 50 

CTAAAAACAA AAATTACTCT TGAAGTTGCT TTGGAACTTG GTGATGGTGC 100 

AGTTCGTACA ATCGCTATGG AATCTACTGA TGGCTTGACT CGTGGACTTG 150 

20 AAGTCCTTGA TACAGGTAAA GCAGTCAGCG TTCCTGTTGG GGAAGCCACT 200 

CTTGGTCGTG TTTTTAACGT TCTTGGTGAT GTTATTGACG GTGGGGAAGA 250 

ATTTGCTGCT GATGCAGAAC GTAATCCTAT CCATAAAAAA GCTCCAACAT 300 

TTGACGAATT GTCAACTGCA AACGAAGTTC TCGTAACTGG GATTAAAGTT 350 

GTCGATTTGC TTGCACCTTA CCTTAAAGGT GGTAAAGTTG GACTTTTCGG 400 

25 TGGTGCCGGA GTTGGTAAAA CCGTCCTTAT TCAAGAATTG ATTCACAACA 4 50 

TCGCCCAAGA ACACGGAGGT ATTTCTGTGT TTACCGGTGT TGGGGAACGT 500 

ACTCGTGAAG GGAATGACCT TTACTGGGAA ATGAAAGAAT CAGGCGTTAT 550 

TGAAAAAACT GCCATGGTCT TTGGTCAAAT GAATGAACCA CCAGGAGCAC 600 

GTATGCGTGT TGCCCTTACT GGTTTGACAA TTGCTGAATA TTTCCGTGAT 650 

30 GTTCAAGGTC AAGACGTACT GCTTTTCATT GACAACATCT TCCGTTTCAC 700 

ACAAGCTGGT TCAGAAGTTT CTGCCCTTTT GGGACGTATG CCTTCTGCCG 750 

TTGGTTACCA ACCAACACTT GCTACTGAAA TGGGGCAATT GCAAGAACGT 800 

ATCACTTCTA CTAAGAAGGG TTCTGTTA 828 



35 



50 



2) INFORMATION FOR SEQ ID NO: 1863 



(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 825 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

45 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Listeria ivanovii 

(B) STRAIN: ATCC 19119 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1863 
CTTACCTGAA ATCTACAACG CCCTAGTTAT TGAATATAAA TCTGATGCAG 50 
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AAGAAGCACC AACTAGCCAA CTTACTTTAG 

GATGACGTTG TTCGTACAAT TGCAATGGCA 

AGGTATGGAA GTTATTGATA CTGGGAGCCC 

CAGTAACACT TGGTCGTGTA TTTAACGTAT 

5 GATGAGCCAC TTCCAAGCGA TATCAAGCGT 

ACCAACATTT GACCAATTAG CAACAACTAC 

TAAAAGTTGT AGACTTGCTA GCCCCATATT 

TTGTTCGGCG GAGCGGGTGT TGGTAAAACC 

TCATAATATC GCTCAAGAAC ATGGTGGTAT 

10 GAGAACGTAC TCGTGAAGGG AACGATCTTT 

GGTGTAATTG AAAAAACTGC CATGGTATTC 

AGGTGCTCGT ATGCGTGTAG CTTTAACAGG 

TCCGTGATGA AGAACACCAA GATGTACTTC 

CGCTTTACTC AAGCTGGTTC AGAGGTTTCG 

15 ATCTGCAGTA GGTTATCAAC CAACTCTAGC 

AAGAACGTAT TACTTCTACT AATGT 



PCT/CAOO/01150 

AAGTAGCCAT TCAATTAGGT 100 

TCAACTGATG GTGTTCAAAG 150 

AATCACAGTT CCAGTTGGTA 200 

TAGGAAACAC TATCGATTTG 250 

AATAAAATTC ACCGTGAAGC 300 

TGAAATTCTT GAAACAGGAA 350 

TAAAAGGTGG TAAAATTGGT 400 

GTTTTAATCC AAGAACTTAT 4 50 

TTCTGTGTTC GCTGGTGTTG 500 

ACTTTGAAAT GAAAGACTCT 550 

GGTCAAATGA ACGAACCACC 600 

TCTTACAATC GCTGAATATT 650 

TATTCATTGA TAATATTTTC 700 

GCTTTACTAG GTCGTATGCC 750 

TACCGAAATG GGACAATTAC 800 

825 



20 2) INFORMATION FOR SEQ ID NO: 1864 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 821 bases 

(B) TYPE: Nucleic acid 
25 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

30 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Succinivibrio dextrinosolvens 

(B) STRAIN: ATCC 19716 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1864 



GTCATATTCC AGAGCTTTAT GACGCTCTGG AGGTTAAGGG CGATGGTAAG 50 

CATCGTTCAG ACCTAGTTCT TGAGGTTCAA CAGCAGATTG GCGGTGGTGT 100 

GGTACGCTGC ATTGCCATGG GTTCTTCTGA CGGTTTGAGC AGAGGAATTG 150 

AGGCTGTAAA TACTGGTGCC GGTGTTAAGG TTCCAGTTGG TCGTGAGACC 200 

40 CTAGGACGTA TTATGAACGT TTTAGGTCAG CCTGTAGATG AGAGAGGTCC 250 

TATCGGACAG AAAGAGGATT GGGAAATTCA CCGTCCAGCT CCTACCTATG 300 

CTGAGCAGTC ATCAACTACA GAAATTCTAG AAACCGGTAT TAAGGTTATG 350 

GACCTTATCT GCCCATTTGC TAAGGGTGGT AAAGTTGGTC TGTTCGGTGG 400 

TGCCGGTGTG GGTAAGACAG TTAACATGAT GGAGCTTATC AATAACATTG 4 50 

45 CTAAGGCTCA CTCAGGTCTA TCTGTATTTA CCGGTGTTGG TGAGCGTACT 500 

CGTGAGGGTA ACGACTTCTA CCACGAAATG CAGGAATCAA AGGTTATCGA 550 

TAAGGTATCA ATGATTTACG GTCAGATGAA CGAGCCTCCA GGGAACCGTC 600 

TACGTGTTGC TCTGACAGGT CTGACTGTTG CTGAGAAGTT CCGTGACGAA 650 

GGTCTGGATG TGCTTCTGTT CATCGATAAC ATCTATCGTT ATACACTGGC 700 

50 TGGTACAGAG GTATCTGCTC TGTTAGGCCG TATGCCTTCA GCTGTGGGTT 7 50 

ACCAGCCTAC ACTGGCTGAG GAAATGGGTG TATTACAGGA GCGTATTGCT 800 

TCAACTAAGA AAGGTTCTAT T 821 
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2) INFORMATION FOR SEQ ID NO: 1865 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 822 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Tetragenococcus halophilus 

(B) STRAIN: ATCC 33315 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1865 

TGATACTTTA CCAGATATCA ATAATGCATT AGCCGTATAT AAAAATGATG 50 

AGAACAAGAC GCGTGTTGTA TTGGAAGCTA CTTTAGAACT TGGAGATGGG 100 

GTAATTCGTG CCATTTCTAT GGGGTCTACT GACGGCTTGC AACGTGGCAT 150 

GGAAGTTGTG GATACACAAG AACCTATTTC TGTTCCGGTA GGAAATGATA 200 

CTTTAGGTCG TGTATTTAAT GTGTTAGGAG AAACAATAGA TAATCAGGAG 250 

CCATTTCCTG AAGATGCTGA AAAAAGTGGT ATTCACAAAA AAGCCCCTAG 300 

TTTTGATGAA TTAAGTACTA GTTCGGAAAT ATTAGAAACA GGGATCAAAG 350 

TGATTGATTT ATTAGAACCT TATCTAAGAG GCGGTAAAGT CGGATTGTTT 4 00 

GGAGGCGCCG GTGTTGGAAA AACGGTGCTA ATTCAAGAAT TGATCAATAA 450 

TGTTGCCCAA GAACACGGGG GTATTTCCGT GTTTAATGGT GTAGGTGAAC 500 

GTACTCGTGA AGGTAATGAC TTGTATTATG AAATGCAGGA TTCAGGCGTT 550 

ATCGAAAAAA CAGCCATGGT GTTTGGTCAA ATGAACGAAC CACCAGGTGC 600 

TCGTATGCGT GTTGCTTTAA CTGGCCTAAC ACTGGCAGAA TATTTTCGAG 650 

ATGTTGAAGG TCAAGACGTA TTATTATTTA TTGATAATAT TTTCCGTTTT 7 00 

ACACAAGCAG GTACCGAAGT TTCCGCTTTA CTTGGTAGAA TGCCATCTGC 7 50 

TGTTGGCTAT CAACCCACAC TAGCAACTGA AATGGGGCAA CTGCAAGAAC 800 

GGATTACGTC AACGGATAAG GG 822 



2) INFORMATION FOR SEQ ID NO: 1866 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 818 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Campylobacter fetus subsp. fetus 

(B) STRAIN: ATCC 25936 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1866 
ATCTCCTCAG GATCKATAGG ACTTGATATA GCTCTTGGTA TAGGCGGCGT 50 

978 
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PCT/CAOO/01150 



ACCAAAAGGA AGAATAGTCG 
CAACTCTTAC TTTGCATTTA 
TGCGCGTTTG TAGATGCAGA 
TTTAGGCGTT GATACGGATA 
5 AGCAAGCTCT TGATATAGTA 
CTTATAGTAA TAGATAGCGT 
AGGCGATATG GGAGATCAGC 
AAGCACTTAG AAAATTAACC 
GTATTTATAA ACCAAATTCG 

10 TCCTGAAACT ACTACTGGCG 
GACTTGACGT ACGTAAAATA 
GGAAACCGCG TAAAAGTAAA 
TAGACAAGCC GAATTTGATA 
GAGAGATAAT AGATTACGGC 

15 GCTTGGTTTA GCTATGATAA 
AAAAGCGTTT TTAAAAGA . 



AAATTTATGG GCCAGAAAGC 
ATAGCAGAAT CTCAAAAAGT 
GCATGCACTT GATGTTAAAT 
ACTTATATAT TTCTCAACCG 
GAAACTCTAG CTAGAAGCGG 
AGCAGCTYTA ACACCAAAAA 
ACGTAGGGCT GCAAGCAAGA 
GGAGTTGTCC ATAAAATGGG 
TATGAAAATC GGCGCTATGG 
GAAATGCGCT TAAATTTTAC 
GCTACTTTAA AACAGAGCGA 
AGTAGTAAAA AACAAAGTCG 
TCATGTTTGG AGAAGGTATC 
GTAAAACTTG ATATTATCGA 
TTCAAAATTA GGTCAAGGCA 



TCTGGTAAAA 


100 


CGGCGGAGTT 


150 


ATGCTAAAAA 


200 


GACTTCGGAG 


250 


CGCCGTTGAT 


300 


GCGAAATAGA 


350 


CTCATGAGTC 


400 


AACTACAGTT 


450 


GCTATGGCAC 


500 


GCTTCAGTTA 


550 


TGAGCCAATC 


600 


CTCCTCCTTT 


650 


AGCAAAGAAG 


700 


TAAAAGCGGC 


750 


GAGAAAACTC 


800 




818 



20 2) INFORMATION FOR SEQ ID NO: 1867 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 814 bases 

(B) TYPE: Nucleic acid 
25 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

30 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Campylobacter fetus subsp. venerealis 

(B) STRAIN: ATCC 33561 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1867 



TCTCCTCAGG ATCKATAGGA CTTGATATAG CTCTTGGTAT AGGCGGCGTA 50 

CCAAAAGGAA GAATAGTCGA AATTTATGGG CCAGAAAGCT CTGGTAAAAC 100 

AACTCTTACT TTGCATTTAA TAGCAGAATC TCAAAAAGTC GGCGGAGTTT 150 

GCGCGTTTGT AG AT GC AG AG CATGCACTTG ATGTTAAATA TGCTAAAAAT 200 

4 0 TTAGGCGTTG ATACGGATAA CTTATATATT TCTCAACCGG ACTTCGGAGA 250 

GCAAGCTCTT GATATAGTAG AAACTCTAGC TAGAAGCGGC GCCGTTGATC 300 

TTATAGTAAT AGATAGCGTA GCAGCTYTAA CACCAAAAAG CGAAATAGAA 350 

GGCGATATGG GAGATCAGCA CGTAGGGCTG CAAGCAAGAC TCATGAGTCA 400 

AGCACTTAGA AAATTAACCG GAGTTGTCCA TAAAATGGGA ACTACAGTTG 450 

4 5 TATTTATAAA CCAAATTCGT ATGAAAATCG GCGCTATGGG CTATGGCACT 500 

CCTGAAACTA CTACTGGCGG AAATGCGCTT AAATTTTACG CTTCAGTTAG 550 

ACTTGACGTA CGTAAAATAG CTACTTTAAA ACAGAGCGAT GAGCCAATCG 600 

GAAACCGCGT AAAAGTAAAA GTAGTAAAAA ACAAAGTCGC TCCTCCTTTT 650 

AGACAAGCCG AATTTGATAT CATGTTTGGA GAAGGTATCA GCAAAGAAGG 700 

50 AGAGATAATA GATTACGGCG TAAAACTTGA TATTATCGAT AAAAGCGGCG 750 

CTTGGTTTAG CTATGATAAT TCAAAATTAG GTCAAGGCAG AGAAAACTCA 800 

AAAGCGTTTT TAAA 814 
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2) INFORMATION FOR SEQ ID NO: 1868 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 824 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Campylobacter jejuni jejuni 

(B) STRAIN: ATCC 33560 , 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1868 

GATAGCATAG GCWCAGGTTC AGTTGGACTT GATCTTGCTT TAGGTATAGG 50 

CGGTGTTCCA AAAGGAAGAA TTATAGAAAT TTATGGGCCT GAAAGTTCAG 100 

GTAAAACTAC TCTAACTCTA CACATTATCG CAGAATGCCA AAAAGCAGGT 150 

GGGGTTTGTG CTTTTATCGA TGCAGAACAT GCACTTGATG TGAAATATGC 200 

TAAAAATTTA GGTGTAAATA CAGATGATTT GTATGTTTCT CAACCTGATT 250 

TTGGAGAGCA AGCCTTAGAA ATTGTAGAAA CTATAGCWAG AAGTGGTGCA 300 

GTAGATCTTA TWGTAGTAGA TAGCGTTGCA GCWCTTACCC CAAAAGCAGA 350 

AATTGAAGGC GATATGGGCG ATCARCATGT AGGACTTCAA GCAAGACTTA 400 

TGTCTCAAGC TCTAAGAAAA CTTACAGGTA TAGTTCATAA AATGAATACC 450 

ACAGTAATTT TCATCAACCA AATTCGTATG AAAATCGGTG CTATGGGTTA 500 

TGGTACTCCT GAAACCACAA CAGGTGGAAA TGCATTAAAA TTTTATGCTT 550 

CTGTGCGTTT AGATGTTAGA AAAGTAGCAA CCTTAAAMCA AAACGWAGAM 600 

CCTATAGGAA ACCGCGTTAA AGTAAAAGTA GTTAAAAATA AAGTTGCTCC 650 

TCCATTCAGM CAAGCTGAAT TTGATGTGAT GTTTGGAGAG GGTTTAAGCC 700 

GTGAAGGTGA ATTGATCGAT TATGGTGTAA AACTTGATAT CGTAGATAAA 750 

AGTGGTGCGT GGTTTTCTTA TAAAGATAAA AAACTTGGAC AAGGTAGAGA 800 

AAATTCAAAA GCTTTCTTAA AAGA 824 



2) INFORMATION FOR SEQ ID NO: 1869 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 388 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus avium 

(B) STRAIN: ATCC 14025 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1869 

AGAAAGTTCT GGTAAAACAA CGGTTGCACT GCATGCGATT GCAGAAGTTC 50 
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AAAAACATGG CGGGACGGCA 
CCTCAATACG CACAACGTCT 
ACAACCAGAT ACTGGGGAAC 
CAAGTGGCGC AGTCGATATT 
5 CCCCGTGCTG AAATCGATGG 
GGCTCGTTTG ATGTCACAAG 
AAACAAAGAC TATCGCCGTC 



GCCTTTATTG ATGCCGAGCA 
AGGTGTAAAC ATTGATGAAT 
AAGGCTTAGA AATTGCAGAT 
ATCGTTATTG ACTCGGTGGC 
CGAGATGGGT GATGCGCACG 
CATTGCGCAA GCTGTCAGGC 
TTTATTAACC AAATTCGT 



PCT7CA00/01150 

CGCGTTGGAC 100 

TGCTGCTATC 150 

GCTTTAGTTT 200 

CGCGCTAGTC 250 

TTGGTCTGCA 300 

TCTATCAACA 350 
388 



10 



20 



15 



2) INFORMATION FOR SEQ ID NO: 1870 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 388 bases 
15 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Entezococcus faecium 

(B) STRAIN: ATCC 19434 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1870 

TGAAAGTTCA GGTAAAACAA CAGTTGCACT ACACGCTATT GCAGAAGTAC 50 

AAAAAAATGG CGGAACGGCC GCTTTCATTG ATGCTGAGCA TGCGTTAGAT 100 

CCGCAATATG CACAAAAATT AGGTGTGAAT ATCGATGAAC TACTTCTTTC 150 

30 ACAGCCTGAC ACAGGAGAAC AAGGTCTAGA GATCGCTGAT GCTTTAGTAT 200 

CAAGTGGGGC TGTAGATATC GTAGTAGTCG ATTCAGTTGC TGCTTTAGTT 250 

CCACGAGCAG AAATCGACGG CGAAATGGGT GACTCACATG TCGGGTTACA 300 

AGCACGTTTG ATGTCTCAAG CATTGCGTAA ACTCTCTGGT TCGATCAACA 350 

AAACAAAAAC AATCGCTATT TTCATCAACC AAATCCGT 388 

B5 



2) INFORMATION FOR SEQ ID NO: 1871 

}0 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 388 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Listeria monocytogenes 
50 (B) STRAIN: ATCC 15313 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1871 
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AGAGAGTTCC 
AAGCACAAGG 
CCGGCTTATG 
TCAACCAGAT 
5 GAAGTGGTGC 
CCACGTGCTG 
AGCACGTTTA 
AATCAAAAAC 



GGTAAAACAA 
CGGAACAGCA 
CTAAAAACCT 
ACAGGAGAAC 
AGTTGATATG 
AAATCGAAGG 
ATGTCCCAAG 
CATTGCTATT 



CTGTTGCGCT 
GCATTTATCG 
AGGTGTAAAT 
AAGCTTTAGA 
TTAGTAATTG 
CGAGATGGGC 
CATTGCGTAA 
TTCATTAACC 



TCATGCAATT 
ATGCTGAGCA 
ATTGATGAAT 
GATTGCTGAA 
ACTCCGTTGC 
GATGCTCATG 
ACTTTCTGGT 
AAATTCGT 



GCGGAAGTAC 50 

TGCGTTGGAT 100 

TATTACTATC 150 

GCTTTAGTTA 200 

AGCACTTGTA 250 

TTGGATTACA 300 

GTTATTAATA 350 
388 



10 



2) INFORMATION FOR SEQ ID NO: 1872 



15 



20 



25 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 388 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus mitis 

(B) STRAIN: ATCC 49456 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1872 



AGAGTCATCT 
AAAAAGAAGG 

30 CCAGCTTATG 
ACAACCAGAC 
ACTCAGGAGC 
CCTCGTGCGG 
GGCTCGTATG 

35 AAACCAAAAC 



GGTAAGACAA 
TGGTATTGCT 
CTGCGGCCCT 
TCAGGAGAGC 
CGTGGATCTT 
AAATTGATGG 
ATGAGCCAGG 
AATTGCCATC 



CGGTTGCCCT 
GCCTTTATCG 
TGGTGTCAAC 
AAGGTCTTGA 
GTCGTAGTCG 
AGATATCGGT 
CTATGCGTAA 
TTTATCAACC 



TCATGCAGTT 
ATGCGGAACA 
ATTGACGAAT 
GATTGCAGGA 
ACTCAGTTGC 
GATAGCCACG 
ACTTGGTGCT 
AATTGCGT 



GCGCAAGCAC 50 

TGCCCTTGAT 100 

TGCTCTTGTC 150 

AAATTGATTG 200 

GGCCCTTGTC 250 

TTGGTTTGCA 300 

TCTATCAATA 350 
388 



2) INFORMATION FOR SEQ ID NO: 1873 



40 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 430 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
45 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
50 (A) ORGANISM: Streptococcus oralis 

(B) STRAIN: ATCC 35037 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1873 

982 
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GAACATGACG CCGACTTTTT 
TTTTGGTTTT ATTGATAGAA 
ATTCGAGCCT GCAAACCAAC 
CGCACGAGGT ACAAGGGCCG 
CACCTGAGTC AATCAATTTT 
TCTGGTTGTG ACAAGAGCAA 
ATAGGCTGGG TCAAGAGCAT 
CTTCTTTCTG TGCTTGCGCA 
GATGATTCTG GCGCRTACAY 



CACGCAATTG GTTGATAAAG ATGGCAATTG 
GCTCCGAGTT TACGCATGGC CTGGCTCATC 
GTGACTGTCT CCAATATCCC CATCAATTTC 
CAACTGAGTC GATAACGACA AGGTCAACTG 
CCAGCAATTT CAAGACCTTG TTCACCTGAG 
TTCGTCAATA TTCACACCAA GGGCTGCAGC 
GTTCCGCATC GATAAAGGCT GCAATACCAC 
ACAGCGTGAA GGGCAACCGT TGTCTTACCA 
TTCGATGATA 



50 
100 
150 
200 
250 
300 
350 
400 
430 



LO 



L5 



20 



25 



30 



55 



10 



15. 



2) INFORMATION FOR SEQ ID NO: 1874 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 947 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aspergillus fumigatus 

(B) STRAIN: ATCC 6474 6 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1874 

TTGTCGTTGT TGCTGCCTCC GACGGTCAGA TGTAGGTGGA ACATCTTGGG 50 

AAATACGTCG TAAAACACGT CGCTTACGTT TTCGCGAATA GGCCCCAGAC 100 

TCGTGAGCAT TTGCTGCTCG CCCGCCAGGT TGGTGTCCAG AAGATCGTTG 150 

TCTTCGTCAA CAAAATCGAT GCTATTGATG ATCCGGAGAT GCTGGAACTG 200 

GTCGAACTCG AGATGCGTGA GCTGCTGAAC AGCTACGGTT TCGAGGGTGA 250 

AGAGACTCCG ATCATTTTCG GTTCCGCTCT CTGTGCTCTC GAAGGACGCC 300 

GTGACGACAT CGGTAAAGAC AGAATTGAGC AGCTTATGAA CGCTGTCGAC 350 

ACCTGGATCC CCACTCCTCA GCGTGACCTC GACAAACCTT TCTTGATGTC 4 00 

TGTCGAGGAA GTGTTCTCTA TCGCCGGCCG TGGTACCGTG GCTTCTGGTC 4 50 

GTGTCGAGCG TGGTATCTTG AAGAAGGACT CTGAGGTTGA GATTGTTGGA 500 

GGCTCCTTCG AACCCAAGAA GACCAAAGTC ACCGACATTG AAACCTTCAA 550 

GAAGAGCTGT GATGAATCGC GTGCTGGTGA CAACTCTGGT CTCCTCCTGC 600 

GTGGTATCCG ACGTGAAGAC GTCAAGCGTG GTATGGTCAT TGCTGTTCCC 650 

GGCAGCACCA AGGCTCACGA CAAGTTCCTC GTCTCCATGT ACGTCCTGAC 700 

CGAGGCGGAG GGTGGTCGTC GTACTGGCTT CGGTGCCAAC TACCGTCCCC 750 

AAGTCTTCAT CCGTACTGCA GGTAAGTTCC CGCACACCGT GTCCAGATCT 800 

TCCGAGAGAT TAGCGATATA TGCTAATGAT TCATCAGACG AGGCTGCTGA 850 

CCTCAGCTTC CCTGACGGCG ACCAATCTCG CAGAGTTATG CCTGGTGACA 900 

ACGTCGAGAT GATCCTGAAG ACCCACCACC CTGTTGCTGC TGAGGCT 947 



2) INFORMATION FOR SEQ ID NO: 1875 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 923 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aspergillus versicolor 

(B) STRAIN: WSA-175 

(xi) SEQUENCE DESCRIPTION: SEQ, ID NO: 1875 

GCTGCTTCCG ATGGTCAAAT GTACGTCAAC CTTATATACA CCCTCTGATA 50 

TGACAGAATG TCGCCTAACA GCACGCGTGA ACTAGGCCCC AAACCCGTGA 100 

GCACTTGCTG CTTGCCCGCC AGGTCGGTGT CCAGAAGATT GTTGTGTTCG 150 

TCAACAAGGT TGATGCCGTC GATGACCCTG AGATGTTGGA ACTTGTTGAG 200 

CTGGAAATGC GTGAGCTTCT CAGCACTTAC GGCTTCGAAG GCGAGGAGAC 250 

CCCTATCATC TTCGGTTCCG CCCTATGCGC CCTCGAGGGT CGCCGCCCCG 300 

ATATCGGTAC TGAGCGAATT GACAGCCTTC TTGAGGCCGT TGACACCTGG 350 

ATCCCTACCC CTCAGCGTGA CCTGGACAAG CCTTTCCTGA TGTCTGTCGA 400 

GGAGGTCTTC TCCATTGCCG GTCGTGGTAC CGTTGCCTCT GGCCGTGTTG 450 

AGCGTGGTCT CCTTAAGAAG GACAGCGAGG TCGAGATTCT CGGAGGTGGT 500 

CAGGTCATGA AGACCAAGGT CACTGACATT GAGACATTCA AGAAGCACTG 550 

TGACGAATCC CGTGCTGGTG ACAACTCCGG TCTTCTTCTC CGTGGTATCC 600 

GCCGTGAGGA TGTCAAGCGC GGTATGGTTA TTGCTGCTCC CGCCTCTATC 650 

AAGGCCCACA AGAAGTTCAT GGTCTCCATG TACGTCCTCA CTGAGGCAGA 700 

AGGTGGCCGT CGCAGTGGCT TCGGTGTCAA CTATCGTCCC CAGGCTTACA 7 50 

TTCGCACTGC TGGTAAGTTT TCAAAACATT CGACCTCTCG CCTTAGAAGA 800 

AGAATACCTC TAACTTGTAT TTAGACGAGG CTTGCGACCT TTCTTTCCCC 850 

GATGGCGACA TGAGCCGCCG TGTCATGCCT GGTGACAACG TGGAAATGAT 900 

CCTCAACCTC AACAACCCTG TCG 923 



2) INFORMATION FOR SEQ ID NO: 1876 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 807 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Basidiobolus ranarum 

(B) STRAIN: ATCC 24670 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1876 

CATCATTGTT GTGTCCGCCA CTGATGGTCA AATGCCTCAA ACTCGTGAGC 50 
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ATTTGTTGTT GGCTCGCCAA GTTGGTGTCC 
AACAAGGTTG ATGCCGTTGA TGATCCTGAA 
GGAAATGCGT GATTTGCTTT CCCAATACGG 
CCATTATCCA GGGTTCCGCT CTCTGCGCTC 
5 ATCGGCCGTA ACGCCATCAT GAAGTTGATG 
CCCCACCCCT GCCCGTGATT TGGACAAACC 
ATGTGTTCTC CATCTCTGGC CGTGGTACTG 
CGTGGTATGG TCACCAAGGG TACTGAAGTT 
GCACTTCAAG ACCACCTTGA CCGGTATTGA 

LO ACAAGGGTAT GGCTGGTGAC AACATGGGTT 
CGTGAGCAAG TCCGTCGTGG TATGGTTATC 
GCCACATAAG AAGTTCATGG CTCAGCTCTA 
GAGGCCGCCA CACTCCCTTC GTCAACAACT 
AGAACTGTTG ATGTTACCGC CATCCTTAAG 

L5 TGCTGATGAG AAGATGGTCA TGCCCGGAGA 
AGCTCTT 



PCT/CA00/01150 

AACACTTGGT TGTCTTCATC 100 

ATGTTGGAGT TGGTCGAGAT 150 

TTTCCCCGGA GACAACGTCC 200 

TTGAGGACCG CAACCCCGAG 250 

GAGGCTGTTG ATAGCAGCAT 300 

TTTCCTCATG CCCGTTGAAG 350 

TTGCCACTGG ACGTGTTGAG 400 

GAAATCGTCG GTATGGGCGA 450 

AATGTTCCAC AAGGAATTGG 500 

GCTTGCTTCG TGGTGTCAAG 550 

TGTGCCCCCG GATCCGTCAA 600 

CATTCTCACC AAGGATGAGG 650 

ACCGCCCACA AATGTTCTTC 700 

CACCCCCCTG GTACCCCCGA 750 

CAACGTTCAA CTCGAGTGCG 800 

807 



20 2) INFORMATION FOR SEQ ID NO: 1877 



15 



10 



J5 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 806 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Campylobacter gracilis 

(B) STRAIN: ATCC 33236 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1877 



AGTTGTTTCT 
TGCTTTCTCG 
ACCGATATGG 
TAGAGATCTT 

10 TTAAGGGTTC 
GGCGAATGGT 
TATTCCAACT 
AAGATATTTT 
GAAAAAGGTA 

15 ACCTACTCAG 
TGGATCAAGG 
AAGAAAGAGG 
CACTCCTCAT 
AAGGCGGACG 

50 GTTCGTACGA 
GATGGTTATG 
CGATCG 



GCTGCGGATG 
TCAAGTAGGC 
TCGATGATCC 
TTAAAAGAGT 
TGCTCTTAAG 
CTGCAAAGAT 
CCTGTTCGCG 
CTCGATTTCC 
TCGTTAAAGT 
ACTACTACCG 
TGAAGCCGGC 
AAGTAGAGCG 
ACTAAATTTG 
CCATACTCCA 
CAGATGTTAC 
CCGGGCGACA 



GTCCTATGCC 
GTTCCATACA 
GGATCTTTTA 
ATAAATTCCC 
GCTCTTGAGG 
TATGGAGCTT 
ATACTGATAA 
GGTCGCGGTA 
TGGTGATACT 
TCACTGGCGT 
GATAATGTAG 
CGGTATGGTT 
AGGGCGAGGT 
TTCTTTAATA 
CGGTTCGATT 
ACGTTAAAAT 



TCAAACTCGC 
TCGTAGTTTT 
GAGTTAGTTG 
TGGCGACGAA 
AAGCTAAGGC 
ATGGACGCGG 
AGATTTCCTT 
CCGTTGTAAC 
ATCGAGATCG 
TGAGATGTTT 
GTGTTTTATT 
TTATGCAAAC 
TTATATCCTA 
ATTATAGACC 
ACTCTTCCTG 
CACCGTTGAG 



GAGCATATCT 50 

CCTAAACAAA 100 

AAGAGGAAGT 150 

ACCCCAATCA 200 

CGGACAAGAC 250 

TTGATAGCTA 300 

CTTCCGATCG 350 

CGGTAGAATC 400 

TAGGTATTAA 450 

AGAAAAGAGA 500 

GCGCGGTACT 550 

CAAAATCGAT 600 

ACTAAAGAAG 650 

GCAGTTTTAC 700 

AAGGAACCGA 750 

CTAATCGCTC 800 
806 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 806 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Campylobacter jejuni jejuni 

(B) STRAIN: ATCC 33292 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1878 

GCTGCAGATG GCCCTATGCC ACAAACTAGA GAGCACATTC TTCTTTCTCG 50 

TCAAGTAGGC GTTCCATATA TTGTTGTTTT TATGAATAAA GCAGATATGG 100 

TTGATGATGC TGAACTTTTA GAGTTAGTTG AAATGGAAAT TAGAGAATTA 150 

TTAAGCTCTT ATGATTTCCC AGGCGATGAT ACACCTATTA TTTCTGGTTC 200 

TGCTTTAAAA GCTCTTGAAG AAGCTAAAGC TGGACAAGAT GGTGAATGGT 250 

CAGCAAAAAT TATGGATCTT ATGGCTGCAG TTGATAGCTA TATTCCAACT 300 

CCAACTCGTG ATACTGAAAA AGACTTCTTG ATGCCAATTG AAGATGTTTT 350 

CTCAATTTCA GGTCGTGGTA CTGTTGTTAC AGGTAGAATT GAAAAAGGTG 4 00 

TTGTAAAAGT AGGTGATACT ATCGAAATCG TTGGTATTAA AGATACTCAA 450 

ACAACAACTG TAACAGGTGT TGAAATGTTC AGAAAAGAAA TGGATCAAGG 500 

CGAAGCAGGA GATAACGTAG GTGTTCTTCT TCGTGGTACT AAAAAAGAAG 550 

AAGTTATCCG TGGTATGGTT CTTGCTAAAC CAAAATCAAT TACTCCACAC 600 

ACTGACTTCG AAGCTGAAGT TTATATCTTA AATAAAGATG AAGGTGGTAG 650 

ACATACTCCA TTCTTTAACA ACTATAGACC ACAGTTTTAT GTAAGAACAA 700 

CTGATGTTAC AGGTTCGATT AAATTAGCTG ATGGTGTTGA AATGGTTATG 750 

CCAGGTGAAA ATGTGAGAAT TACTGTAAGC TTGATCGCTC CAGTAGCACT 800 

TGAAGA 806 



2) INFORMATION FOR SEQ ID NO: 1879 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 896 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Coccidioides immitis 

(B) STRAIN: WSA-222 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1879 

ATGTATGCAA CCGAGAGCAC TCCCGGATCT TGGTTTAAAT GGCACTAATA 50 

986 

SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CAOO/01150 



TAAGACAGGC CTCAAACTCG AGAGCATTTA CTTCTCGCCC GTCAGATCGG 100 

TATCCAAAAA ATCGTCGTCT TCGTGAACAA GGTTGATGCC ATCGAGGACA 150 

MGAGATGTT GGAGCTTGTT GAATTGGAGA TGCGTGAACT CCTAACCAGC 200 

TACGGTTTCS AGGGTGAAGA AACTCCCATC ATTTTTGGCT CTGCTCTCTG 250 

5 TGCCCTCSAA GGAAGACAAC CCGAGATCGG TGTTACCAAG ATTGATGAGC 300 

TCTTGCAGGC CGTCGACACC TGGATTCCCA CTCCTCAGCG TGAGACTGAC 350 

AAGCCCTTCT TGATGTCCAT TGAGGAAGTG TTCTCTATTT CCGGACGAGG 4 00 

AACCGTTGTC TCCGGCCGTG TGGAGCGTGG TATCCTCAAG AAGGACTCCG 4 50 

AAGTTGAAAT TGTCGGCGGT TCGCCCGAGC CAATCAAAAC CAAGGTTACC 500 

LO GATATCGAAA CCTTTAAGAA GTCTTGCGAC GAGTCTCGCG CTGGTGATAA 550 

CTCCGGCTTG CTCCTACGAG GCGTTAAGCG TGAAGATATT AGCCGTGGCA 600 

TGGTCGTCGC TGTACCAGGA AGTGTCAAGG CCCATACTGA ATTCTTAGTT 650 

TCGCTTTACG TCCTCACCGA AGCTGAGGGT GGGCGCAAAT CTGGATTCAG 700 

CAGCAAGTAC CGCCCACAGA TGTTCATTCG CACTGCCGGT ATGTAATACT 750 

L5 GTGATAATTT CGTTGACATG GTACTGATTG AATTCTATAG ACGAAGCGGC 800 

TCAGCTCAGC TGGCCCGGAG. AAGATCAAGA CAAGATGGCT ATGCCAGGAG 850 

ACAATATCGA AATGATTTGC ACCACCTTGC ACCCAGTTGC CGCCGA 8 96 



20 



2) INFORMATION FOR SEQ ID NO: 1880 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 798 bases 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

30 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Erwinia amylovora 

(B) STRAIN: ATCC 14976 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1880 



CTGGTAGTTG 
CCTGCTGGGT 
AATGTGACAT 

10 GTMCGTGACC 
CGTGCRYGGT 
CGAAGATCAT 
GAGCGTGCGA 
CATCTCTGGC 

15 TTAAAGTGGG 
TCAACCTGTA 
TGCGGGTGAG 
TCCAGCGTGG 
AAGTTCGAGT 

50 TACTCCGTTC 
ACGTGACCGG 
GGCGACAACA 



CTGCGACTGA 
CGCCAGGTTG 
GGTTGATGAT 
TGCTGTCACA 
TCTGCGCTGA 
CGAACTGGCT 
TTGACAAACC 
CGTGGTACCG 
TGAAGAAGTT 
CCGGCGTTGA 
AACTGTGGTA 
TCAGGTTCTG 
CAGAAGTTTA 
TTCAAAGGCT 
GACTATCGAA 
TTCAGATGGT 



CGGCCCAATG 
GCGTGCCATA 
GAAGAGCTGC 
GTACGACTTC 
AAGCGCTGGA 
GGCCATCTGG 
GTTCCTGCTG 
TTGTTACCGG 
GAAATCGTTG 
GATGTTCCGT 
TCCTGCTGCG 
GCGAAGCCAG 
TATTCTGTCT 
ACCGTCCACA 
CTGCCAGAAG 
TGTGACCCTG 



CCTCAGACYC 


GTGAGCACAT 


50 


CATCATCGTG 


TTCCTGAACA 


100 


TGGAGCTGGT 


TGAAATGGAA 


150 


CCAGGCGACG 


ACACGCCAAT 


200 


RGGCGAAGCA 


GAGTGGGAAG 


250 


ATAACTACAT 


CCCGGAACCA 


300 


CCAATTGAAG 


ACGTGTTCTC 


350 


TCGTGTAGAG 


CGCGGTRTSG 


400 


GTATCAAAGA 


TACCGTGAAA 


450 


AAGCTGCTGG 


ACGAAGGCCG 


500 


CGGTATCAAG 


CGCGAAGATA 


550 


GCACCATCAA 


GCCACACACC 


600 


AAAGACGAAG 


GCGGCCGTCA 


650 


GTTCTACTTC 


CGTACTACTG 


700 


GCGTTGAGAT 


GGTGATGCCA 


750 


ATCCACCCGA 


TCGCCATG 


798 
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2) INFORMATION FOR SEQ ID NO: 1881 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 810 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella choleraesuis subsp. choleraesuis 
serotype Typhimurium 

(B) STRAIN: ATCC 14028 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1881 

ATCCTGGTTG TTGCTGCGAC TGACGGYCCG ATGCCGCAGA CCCGTGAGCA 50 

CATCCTGCTG GGTCGTCAGG TAGGCGTTCC GTACATCATC GTGTTCCTGA 100 

ACAAATGCGA CATGGTTGAT GACGAAGAGC TGCTGGAACT GGTTGAAATG 150 

GAAGTTCGYG AACTGCTGTC TCAGTACGAC TTCCCGGGCG ACGACACTCC 200 

GATCGTTCGT GGTTCTGCTC TGAAAGCGCT GGAAGGCGAC GCAGAGTGGG 250 

AAGCGAAAAT CATCGAACTG GCTGGCTTCC TGGATTCTTA CATYCCGGAA 300 

CCAGAGCGTG CGATTGACAA GCCGTTCCTG CTGCCGATCG AAGACGTATT 350 

CTCCATCTCC GGTCGTGGTA CCGTTGTTAC CGGTCGTGTA GARCGCGGTA 4 00 

TCATCAAAGT GGGCGAAGAA GTTGAAATCG TTGGTATCAA AGAGACTCAG 450 

AAGTCTACCT GTACTGGCGT TGAAATGTTC CGCAAACTGC TGGACGAAGG 500 

CCGTGCCGGT GAGAACGTAG GTGTTCTGCT GCGTGGTATC AAACGTGAAG 550 

AAATCGAACG TGGTCAGGTA CTGGCTAAGC CGGGCACCAT CAAGCCGCAC 600 

ACCAAGTTCG AATCTGAAGT GTACATTCTG TCCAAAGATG AAGGCGGCCG 650 

TCATACTCCG TTCTTCAAAG GCTACCGTCC GCAGTTCTAC TTCCGTACTA 7 00 

CTGACGTGAC TGGCACCATC GAACTGCCGG AAGGCGTAGA GATGGTAATG 7 50 

CCGGGCGACA ACATCAAAAT GGTTGTTACC CTGATCCACC CGATCGCGAT 800 

GGACGACGGT 810 



2) INFORMATION FOR SEQ ID NO: 1882 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 888 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus cohnii 

(B) STRAIN: BM10711 

(C) ACCESSION NUMBER : AF015628 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1882 
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ATGAATTTTT ATTTAGAGGA GTTTAACTTG TCTATTCCCG ATTCAGGTCC 50 

ATACGGTATA ACTTCATCAG AAGACGGAAA GGTATGGTTC ACACAACATA 100 

AGGCAAACAA AATCAGCAGT CTAGATCAGA GTGGTAGGAT AAAAGAATTC 150 

5 GAAGTTCCTA CCCCTGATGC TAAAGTGATG TGTTTAATTG TATCTTCACT 200 

TGGAGACATA TGGTTTACAG AGAATGGTGC AAATAAAATC GGAAAGCTCT 250 

CAAAAAAAGG TGGCTTTACA GAATATCCAT TGCCACAGCC GGATTCTGGT 300 

CCTTACGGAA TAACGGAAGG TCTAAATGGC GATATATGGT TTACCCAATT 350 

GAATGGAGAT CGTATAGGAA AGTTGACAGC TGATGGGACT ATTTATGAAT 4 00 

10 ATGATTTGCC AAATAAGGGA TCTTATCCTG CTTTTATTAC TTTAGGTTCG 4 50 

GATAACGCAC TTTGGTTCAC GGAGAACCAA AATAATTCTA TTGGAAGGAT 500 

TACAAATACA GGGAAATTAG AAGAATATCC TCTACCAACA AATGCAGCGG 550 

CTCCAGTGGG TATCACTAGT GGTAACGATG GTGCACTCTG GTTTGTCGAA 600 

ATTATGGGCA ACAAAATAGG TCGAATCACT ACAACTGGTG AGATTAGCGA 650 

15 ATATGATATT CCAACTCCAA ACGCACGTCC ACACGCTATA ACCGCGGGGA 700 

AAAATAGCGA AATATGGTTT ACTGAATGGG GGGCAAATCA AATCGGCAGA 750 

ATTACAAACG ACAAAACAAT TCAAGAATAT CAACTTCAAA CAGAAAATGC 800 

GGAACCTCAT GGTATTACCT TTGGAAAAGA TGGATCCGTA TGGTTTGCAT 850 

TAAAATGTAA AATTGGGAAG CTGAATTTGA ACGAATGA 888 

20 



2) INFORMATION FOR SEQ ID NO: 1883 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

30 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1883 
35 AGCCGCTTGA GCAAATTAAA CTA 23 



2) INFORMATION FOR SEQ ID NO: 1884 



40 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
45 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1884 

50 

GTATCCCGCA GATAAATCAC CAC 23 
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2) INFORMATION FOR SEQ ID NO: 1885 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1885 
AGCGAAAAAC ACCTTGCCGA C 



2) INFORMATION FOR SEQ ID NO: 1886 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1886 
GACGCCCGCG CCACCACT 



2) INFORMATION FOR SEQ ID NO: 1887 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1887 
GACGCCCGCG ACACCACTA 



2) INFORMATION FOR SEQ ID NO: 1888 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 
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(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1888 
GACGCCCGCA ACACCACTA 19 



2) INFORMATION FOR SEQ ID NO: 1889 



(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

20 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1889 
GTTCGCAACT GCAGCTGCTG 20 

25 

2) INFORMATION FOR SEQ ID NO: 1890 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

35 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1890 
40 TTCGCAACGG CAGCTGCTG 19 



2) INFORMATION FOR SEQ ID NO: 1891 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
50 ( D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
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CCGGAGCTGC CGAICGGG 
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2) INFORMATION FOR SEQ ID NO: 1892 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1892 
CGGAGCTGCC AARCGGGG 



2) INFORMATION FOR SEQ ID NO: i893 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1893 
GGAGCTGGCG ARCGGGGT 



2) INFORMATION FOR SEQ ID NO: 1894 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1894 
GACCGGAGCT AGCGARCG 
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2) INFORMATION FOR SEQ ID NO: 1895 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1895 
CGGAGCTAGC AARCGGGGT 



2) INFORMATION FOR SEQ ID NO: 1896 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1896 
GAAACGGAAC TGAATGAGGC G 



2) INFORMATION FOR SEQ ID NO: 1897 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1897 
CATTACCATG GGCGATAACA G 



2) INFORMATION FOR SEQ ID NO: 1898 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 22 bases 
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(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1898 
CCATTACCAT GAGCGATAAC AG 



2) INFORMATION FOR SEQ ID NO: 1899 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 861 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella pneumoniae 

(B) STRAIN: 15571 

(C) ACCESSION NUMBER: AF124984 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1899 

ATGCGTTATA TTCGCCTGTG TATTATCTCC CTGTTAGCCA CCCTGCCGCT 50 

GGCGGTACAC GCCAGCCCGC AGCCGCTTGA GCAAATTAAA CTAAGCGAAA 100 

GCCAGCTGTC GGGCCGCGTA GGCATGATAG AAATGGATCT GGCCAGCGGC 150 

CGCACGCTGA CCGCCTGGCG CGCCGATGAA CGCTTTCCCA TGATGAGCAC 200 

CTTTAAAGTA GTGCTCTGCG GCGCAGTGCT GGCGCGGGTG GATGCCGGTG 250 

ACGAACAGCT GGAGCGAAAG ATCCACTATC GCCAGCAGGA TCTGGTGGAC 300 

TACTCGCCGG TCAGCGAAAA ACACCTTGCC GACGGCATGA CGGTCGGCGA 350 

ACTCTGCGCC GCCGCCATTA CCATGAGCGA TAACAGCGCC GCCAATCTGC 400 

TACTGGCCAC CGTCGGCGGC CCCGCAGGAT TGACTGCCTT TTTGCGCCAG 450 

ATCGGCGACA ACGTCACCCG CCTTGACCGC TGGGAAACGG AACTGAATGA 500 

GGCGCTTCCC GGCGACGCCC GCGACACCAC TACCCCGGCC AGCATGGCCG 550 

CGACCCTGCG CAAGCTGCTG ACCAGCCAGC GTCTGAGCGC CCGTTCGCAA 600 

CGGCAGCTGC TGCAGTGGAT GGTGGACGAT CGGGTCGCCG GACCGTTGAT 650 

CCGCTCCGTG CTGCCGGCGG GCTGGTTTAT CGCCGATAAG ACCGGAGCTG 700 

GCGAGCGGGG TGCGCGCGGG ATTGTCGCCC TGCTTGGCCC GAATAACAAA 750 

GCAGAGCGCA TTGTGGTGAT TTATCTGCGG GATACCCCGG CGAGCATGGC 800 

CGAGCGAAAT CAGCAAATCG CCGGGATCGG CGCGGCGCTG ATCGAGCACT 850 

GGCAACGCTA A 861 



2) INFORMATION FOR SEQ ID NO: 1900 

(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 780 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

5 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella pneumoniae 
10 (B) STRAIN: SLK-47 

(C) ACCESSION NUMBER: Y11069 

(xi) SEQUENCE DESCRIPTION :' SEQ ID NO: 1900 

15 CTGTTAGCCA CCCTGCCGCT GGCGGTACAC GCCAGCCCGC AGCCGCTTGA 50 

GCAAATTAAA CTAAGCGAAA GCCAGCTGTC GGGCCGCGTA GGCATGATAG 100 

AAATGGATCT GGCCAGCGGC CGCACGCTGA CCGCCTGGCG CGCCGATGAA 150 

CGCTTTCCCA TGATGAGCAC CTTTAAAGTA GTGCTCTGCG GCGCAGTGCT 200 

GGCGCGGGTG GATGCCGGTG ACGAACAGCT GGAGCGAAAG ATCCACTATC 250 

20 GCCAGCAGGA TCTGGTGGAC TACTCGCCGG TCAGCGAAAA ACACCTTGCC 300 

GACGGCATGA CGGTCGGCGA ACTCTGCGCC GCCGCCATTA CCATGAGCGA 350 

TAACAGCGCC GCCAATCTGC TACTGGCCAC CGTCGGCGGC CCCGCAGGAT 400 

TGACTGCCTT TTTGCGCCAG ATCGGCGACA ACGTCACCCG CCTTGACCGC 450 

TGGGAAACGG AACTGAATGA GGCGCTTCCC GGCGACGCCC GCGCCACCAC 500 

2 5 TACCCCGGCC AGCATGGCCG CGACCCTGCG CAAGCTGCTG ACCAGCCAGC 550 

GTCTGAGCGC CCGTTCGCAA CGGCAGCTGC TGCAGTGGAT GGTGGACGAT 600 

CGGGTCGCCG GACCGTTGAT CCGCTCCGTG CTGCCGGCGG GCTGGTTTAT 650 

CGCCGATAAG ACCGGAGCTG GCGAGCGGGG TGCGCGCGGG ATTGTCGCCC 700 

TGCTTGGCCC GAATAACAAA GCAGAGCGCA TTGTGGTGAT TTATCTGCGG 7 50 

30 GATACCCCGG CGAGCATGGC CGAGCGAAAT 780 



2) INFORMATION FOR SEQ ID NO: 1901 



35 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 861 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
40 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
45 (A) ORGANISM: Escherichia coli 

(C) ACCESSION NUMBER: U92041 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1901 

50 ATGCGTTATA TTCGCCTGTG TATTATCTCC CTGTTAGCCA CCCTGCCGCT 50 

GGCGGTACAC GCCAGCCCGC AGCCGCTTGA GCAAATTAAA CTAAGCGAAA 100 

GCCAGCTGTC GGGCCGCGTA GGCATGATAG AAATGGATCT GGCCAGCGGC 150 

CGCACGCTGA CCGCCTGGCG CGCCGATGAA CGCTTTCCCA TGATGAGCAC 200 
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CTTTAAAGTA 
ACGAACAGCT 
TACTCGCCGG 
ACTCTGCGCC 
TACTGGCCAC 
ATCGGCGACA 
GGCGCTTCCC 
CGACCCTGCG 
CGGCAGCTGC 
CCGCTCCGTG 
GCGAGCGGGG 
GCAGAGCGCA 
CGAGCGAAAT 
GGCAACGCTA 



GTGCTCTGCG 
GGAGCGAAAG 
TCAGCGAAAA 
GCCGCCATTA 
CGTCGGCGGC 
ACGTCACCCG 
GGCGACGCCC 
CAAGCTGCTG 
TGCAGTGGAT 
CTGCCGGCGG 
TGCGCGCGGG 
TTGTGGTGAT 
CAGCAAATCG 
A 



GCGCAGTGCT 
ATCCACTATC 
ACACCTTGCC 
CCATGAGCGA 
CCCGCAGGAT 
CCTTGACCGC 
GCAACACCAC 
ACCAGCCAGC 
GGTGGACGAT 
GCTGGTTTAT 
ATTGTCGCCC 
TTATCTGCGG 
CCGGGATCGG 



GGCGCGGGTG GATGCCGGTG 
GCCAGCAGGA TCTGGTGGAC 
GACGGCATGA CGGTCGGCGA 
TAACAGCGCC GCCAATCTGC 
TGACTGCCTT TTTGCGCCAG 
TGGGAAACGG AACTGAATGA 
TACCCCGGCC AGCATGGCCG 
GTCTGAGCGC CCGTTCGCAA 
CGGGTCGCCG GACCGTTGAT 
CGCCGATAAG ACCGGAGCTG 
TGCTTGGCCC GAATAACAAA 
GATACCCCGG CGAGCATGGC 
CGCGGCGCTG ATCGAGCACT 



250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
850 
861 



2) INFORMATION FOR SEQ ID NO: 1902 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 861 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella pneumoniae 

(B) STRAIN: 803 

(C) ACCESSION NUMBER: AF164577 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1902 

ATGCGTTATA TTCGCCTGTG TATTATCTCC CTGTTAGCCA CCCTGCCGCT 50 

GGCGGTACAC GCCAGCCCGC AGCCGCTTGA GCAAATTAAA CAAAGCGAAA 100 

GCCAGCTGTC GGGCCGCGTA GGCATGATAG AAATGGATCT GGCCAGCGGC 150 

CGCACGCTGA CCGCCTGGCG CGCCGATGAA CGCTTTCCCA TGATGAGCAC 200 

CTTTAAAGTA GTGCTCTGCG GCGCAGTGCT GGCGCGGGTG GATGCCGGTG 250 

ACGAACAGCT GGAGCGAAAG ATCCACTATC GCCAGCAGGA TCTGGTGGAC 300 

TACTCGCCGG TCAGCGAAAA ACACCTTGCC GACGGCATGA CGGTCGGCGA 350 

ACTCTGCGCC GCCGCCATTA CCATGAGCGA TAACAGCGCC GCCAATCTGC 400 

TGCTGGCCAC CGTCGGCGGC CCCGCAGGAT TGACTGCCTT TTTGCGCCAG 4 50 

ATCGGCGACA ACGTCACCCG CCTTGACCGC TGGGAAACGG AACTGAATGA 500 

GGCGCTTCCC GGCGACGCCC GCGACACCAC TACCCCGGCC AGCATGGCCG 550 

CGACCCTGCG CAAGCTGCTG ACCAGCCAGC GTCTGAGCGC CCGTTCGCAA 600 

CGGCAGCTGC TGCAGTGGAT GGTGGACGAT CGGGTCGCCG GACCGTTGAT 650 

CCGCTCCGTG CTGCCGGCGG GCTGGTTTAT CGCCGATAAG ACCGGAGCTG 700 

CCGAGCGGGG TGCGCGCGGG ATTGTCGCCC TGCTTGGCCC GAATAACAAA 750 

GCAGAGCGCA TTGTGGTGAT TTATCTGCGG GATACGCCGG CGAGCATGGC 800 

CGAGCGAAAT CAGCAAATCG CCGGGATCGG CGCGGCGCTG ATCGAGCACT 850 

GGCAACGCTA A 861 
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2) INFORMATION FOR SEQ ID NO: 1903 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 896 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella pneumoniae 

(B) STRAIN: ATCC 700603 

(C) ACCESSION NUMBER: AF132290 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1903 

ATGCGTTATT TTCGCCTGTG TATTATCTCC CTGTTAGCCA CCCTGCCGCT 50 

GGCGGTACAC GCCAGCCCGC AGCCGCTTGA GCAAATTAAA CTAAGCGAAA 100 

GCCAGCTGTC GGGCAGCGTA GGCATGATAG AAATGGATCT GGCCAGCGGC 150 

CGCACGCTGA CCGCCTGGCG CGCCGATGAA CGCTTTCCCA TGATGAGCAC 200 

CTTTAAAGTA GTGCTCTGCG GCGCAGTGCT GGCGCGGGTG GATGCCGGTG 250 

ACGAACAGCT GGAGCGAAAG ATCCACTATC GCCAGCAGGA TCTGGTGGAC 300 

TACTCGCCGG TCAGCGAAAA ACACCTTGCC GACGGCATGA CGGTCGGCGA 350 

ACTCTGTGCC GCCGCCATTA CCATGAGCGA TAACAGCGCC GCCAATCTGC 4 00 

TGCTGGCCAC CGTCGGCGGC CCCGCAGGAT TGACTGCCTT TTTGCGCCAG 450 

ATCGGCGACA ACGTCACCCG CCTTGACCGC TGGGAAACGG AACTGAATGA 500 

GGCGCTTCCC GGCGACGCCC GCGACACCAC TACCCCGGCC AGCATGGCCG 550 

CGACCCTGCG CAAGCTGCTG ACCAGCCAGC GTCTGAGCGC CCGTTCGCAA 600 

CGGCAGCTGC TGCAGTGGAT GGTGGACGAT CGGGTCGCCG GACCGTTGAT 650 

CCGCTCCGTG CTGCCGGCGG GCTGGTTTAT CGCCGATAAG ACCGGAGCTG 700 

CCAAACGGGG TGCGCGCGGG ATTGTCGCCC TGCTTGGCCC GAATAACAAA 750 

GCAGAGCGGA TTGTGGTGAT TTATCTGCGG GATACGCCGG CGAGCATGGC 800 

CGAGCGAAAT CAGCAAATCG CCGGGATCGG CGCGGCGCTG ATCGAGCACT 850 

GGCAACGCTA ACCCGGCGGT GGCCGCGCGC GTTATCCGGC TCGTAG 896 



2) INFORMATION FOR SEQ ID NO: 1904 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 861 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 

(B) STRAIN: JC2926 

(C) ACCESSION NUMBER: AF148851 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1904 

ATGCGTTATA TTCGCCTGTG TATTATCTCC CTGTTAGCCA CCCTGCCGCT 50 

GGCGGTACAC GCCAGCCCGC AGCCGCTTGA GCAAATTAAA CTAAGCGAAA 100 

5 GCCAGCTGTC GGGCCGCGTA GGCATGATAG AAATGGATCT GGCCAGCGGC 150 

CGCACGCTGA CCGCCTGGCG CGCCGATGAA CGCTTTCCCA TGATGAGCAC 200 

CTTTAAAGTA GTGCTCTGCG GCGCAGTGCT GGCGCGGGTG GATGCCGGTG 250 

ACGAACAGCT GGAGCGAAAG ATCCACTATC GCCAGCAGGA TCTGGTGGAC 300 

TACTCGCCGG TCAGCGAAAA ACACCTTGCC GACGGCATGA CGGTCGGCGA 350 

10 ACTCTGCGCC GCCGCCATTA CCATGAGCGA TAACAGCGCC GCCAATCTGC 400 

TACTGGCCAC CGTCGGCGGC CCCGCAGGAT TGACTGCCTT TTTGCGCCAG 4 50 

ATCGGCGACA ACGTCACCCG CCTTGACCGC TGGGAAACGG AACTGAATGA 500 

GGCGCTTCCC GGCGACGCCC GCGACACCAC TACCCCGGCC AGCATGGCCG 550 

CGACCCTGCG CAAGCTGCTG ACCAGCCAGC GTCTGAGCGC CCGTTCGCAA 600 

15 CGGCAGCTGC TGCAGTGGAT GGTGGACGAT CGGGTCGCCG GACCGTTGAT 650 

CCGCTCCGTG CTGCCGGCGG. GCTGGTTTAT CGCCGATAAG ACCGGAGCTA 700 

GCGAGCGGGG TGCGCGCGGG ATTGTCGCCC TGCTTGGCCC GAATAACAAA 750 

GCAGAGCGCA TTGTGGTGAT TTATCTGCGG GATACCCCGG CGAGCATGGC 800 

CGAGCGAAAT CAGCAAATCG CCGGGATCGG CGCGGCGCTG ATCGAGCACT 850 

20 GGCAACGCTA A 861 



2) INFORMATION FOR SEQ ID NO: 1905 



25 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 861 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
30 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
35 (A) ORGANISM: Pseudomonas aeruginosa 

(C) ACCESSION NUMBER: AF096930 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1905 

4 0 ATGCGTTATA TTCGCCTGTG TATTATCTCC 
GGCGGTACAC GCCAGCCCGC AGCCGCTTGA 
GCCAGCTGTC GGGCCGCGTA GGCATGATAG 
CGCACGCTGA CCGCCTGGCG CGCCGATGAA 
CTTTAAAGTA GTGCTCTGCG GCGCAGTGCT 

45 ACGAACAGCT GGAGCGAAAG ATCCACTATC 
TACTCGCCGG TCAGCGAAAA ACACCTTGCC 
ACTCTGCGCC GCCGCCATTA CCATGAGCGA 
TACTGGCCAC CGTCGGCGGC CCCGCAGGAT 
ATCGGCGACA ACGTCACCCG CCTTGACCGC 

50 GGCGCTTCCC GGCGACGCCC GCGACACCAC 
CGACCCTGCG CAAGCTGCTG ACCAGCCAGC 
CGGCAGCTGC TGCAGTGGAT GGTGGACGAT 
CCGCTCCGTG CTGCCGGCGG GCTGGTTTAT 
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CTGTTAGCCA 


CCCTGCCGCT 


50 


GCAAATTAAA 


CTAAGCGAAA 


100 


AAATGGATCT 


GGCCAGCGGC 


150 


CGCTTTCCCA 


TGATGAGCAC 


200 


GGCGCGGGTG 


GATGCCGGTG 


250 


GCCAGCAGGA 


TCTGGTGGAC 


300 


GACGGCATGA 


CGGTCGGCGA 


350 


TAACAGCGCC 


GCCAATCTAC 


400 


TGACTGCCTT 


TTTGCGCCAG 


450 


TGGGAAACGG 


AACTGAATGA 


500 


TACCCCGGCC 


AGCATGGCCG 


550 


GTCTGAGCGC 


CCGTTCGCAA 


600 


CGGGTCGCCG 


GACCGTTGAT 


650 


CGCCGATAAG 


ACCGGAGCTA 


700 
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GCAAACGGGG TGCGCGCGGG ATTGTCGCCC TGCTTGGCCC GAATAACAAA 
GCAGAGCGCA TTGTGGTGAT TTATCTGCGG GATACGCCGG CGAGCATGGC 
GGAGCGAAAT CAGCAAATCG CCGGGATCGG CGCGGCGCTG ATCGAGCACT 
GGCAACGCTA A 



750 
800 
850 
861 



2) INFORMATION FOR SEQ ID NO: 1906 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single j 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1906 
CCTTATTCCC TTTTTTGCGG 20 



2) INFORMATION FOR SEQ ID NO: 1907 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1907 
CACCTATCTC AGCGATCTGT CT 22 



2) INFORMATION FOR SEQ ID NO: 1908 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1908 
AACAGCGGTA. AGATCCTTGA GAG 23 
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2) INFORMATION FOR SEQ ID NO: 1909 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1909 
ATGACTTGGT TAAGTACTCA CC 22 

2) INFORMATION FOR SEQ ID NO: 1910 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1910 
ATGACTTGGT TGAGTACTCA CC 22 

30 

2) INFORMATION FOR SEQ ID NO: 1911 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

40 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1911 
45 CCATAACCAT GGGTGATAAC AC 22 



2) INFORMATION FOR SEQ ID NO: 1912 

50 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

1000 
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